1 TCGCGCCTTT CACCGGCACC TTGCGTCGGT CGCGCCGCGG GGCCTGCTCC 

51 TGCCGCGCGC ACCCCCGGGG CTTCGGCTCC GGCACGGGTC GCGCCCAGCT 

101 TTCCTGCACC TGAGGCCGCC GGCCAGCCGC CGCCATGGGT GCCTACCTCT 

151 CCCAGCCCAA CACGGTGAAG TGCTCCGGGG ACGGGGTCGG CGCCCCGCGC 

201 CTGCCGCTGC CCTACGGCTT CTCCGCCATG CAAGGCTGGC GCGTCTCCAT 

251 GGAGGATGCT CACAACTGTA TTCCTGAGCT GGACAGTGAG ACAGCCATGT 

301 TTTCTGTCTA CGATGGACAT GGAGGGGAGG AAGTTGCCTT GTACTGTGCC 

351 AAATATCTTC CT GAT AT CAT CAAAGATCAG AAGGCCTACA AGGAAGGCAA 

401 GCTACAGAAG GCTTTAGAAG ATGCCTTCTT GGCTATTGAC GCCAAATTGA 

451 CCACTGAAGA AGTCATTAAA GAGCTGGCAC AGATTGCAGG GCGACCCACT 

501 GAGGATGAAG ATGAAAAAGA AAAAGTAGCT GATGAAGATG ATGGGGACCA 

551 CTTCTATAAG AGAAACAAGA ACCTGCCACC TGAGGAACAG ATGATTTCAG 

601 CCCTTCCTGA CATCAAGGTG CTGACTCTCA CTGACGACCA TGAATTCATG 

651 GTCATTGCCT GTGATGGCAT CTGGAATGTG ATGAGCAGCC AGGAAGTTGT 

701 AGATTTCATT CAATCAAAGA TCAGCCAGCG TGATGAAAAT GGGGAGCTTC 

751 GGTTATTGTC ATCCATTGTG GAAGAGCTGC TGGATCAGTG CCTGGCACCA 

801 GACACTTCTG GGGATGGTAC AGGGTGTGAC AACATGACCT GCATCATCAT 

851 TTGCTTCAAG CCCCGAAACA CAGCAGAGCT CCAGCCAGAG AGTGGCAAGC 

901 GAAAACTAGA GGAGGTGCTC TCTACTGAGG GGGCTGAAGA AAATGGCAAC 

951 AGCGACAAGA AGAAGAAGGC CAAGCGAGAC TAGCAGTCAT CCAGACCCCT 

1001 GCCCACCTAG ACTGTTTTCT GAGCCCTCCG GACCTGAGAC TGAGTTTTGT 

1051 CTTTTTCCTT TAGCCTTAGC AGTGGGTATG AGGTGTGCAG GGGGAGCTGG 

1101 GTGGCTTCAC TCCGCCCATT CCAAAGAGGG CTCTCCCTCC ACACTGCAGC 

1151 CGGGAGCCTC TGCTGTCCTT CCCAGCCGCC TCTGCTCCTC GGGCT CAT CA 

1201 CCGGTTCTGT GCCTGTGCTC TGTTGTGTTG GAGGGAAGGA CTGGCGGTTC 

1251 TGGTTTTTAC TCTGTGAACT TTATTTAAGG ACATTCTTTT TTATTGGCGG 

1301 CTCCATGGCC CTCGGCCGCT TGCACCCGCT CTCTGTTGTA CACTTTCAAT 

1351 CAACACTTTT TCAGACTAAA GGCCAAAACC TAATCGTTAA AAAAAAAAAA 

1401 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA (SEQ ID NO:l) 



FEATURES : 

5' UTR: 1-134 
Start: 135 
Stop: 981 
3' UTR: 984 



Homologous proteins: 

Top 10 BLAST Hits 



CRAI 18000005093929 
CRAj 18000005157254 
CRA| 18000005062172 
CRA! 1000682325310 
CRA| 18000004879328 
CRA| 18000004973158 
CRA| 18000005006718 
CRA] 18000004973159 
CRA|89000000194666 
CRAU8000004972853 
CRAI89000000203611 
CRA|18000004915558 
CRAI18000005051471 



/dataset=nraa 
/dataset=nraa 
/dataset^nraa 
/dataset=nraa / 
/dataset-nraa 
/dataset=nraa 
/dataset=nraa 
/dataset=nraa 
/dataset=nraa 
/dataset^nraa 
/dataset=nraa 
/dataset=nraa 
/dataset=nraa 



/length-54 6 
/length-542 
/length-54 3 
length-356 / 
/length-34 8 
/length=370 
/length=414 
/length=414 
/length=352 
/length=4 64 
/length-662 
/length=468 
/length=4 68 





Score 


E 


/altid-gi 1 4505999 . . 


301 


6e- 


81 


/altid=gi| 6679793 . . 


284 


le- 


75 


/altid=gi 1 3122591 . . 


283 


le- 


75 


altid=gi I 7508306 /. . 


161 


le- 


38 


/altid-gi 1 1352682 


161 


le- 


38 


/altid=gi 1 1171963 


154 


9e- 


37 


/altid=gii 1171964 . . 


154 


le- 


36 


/altid=gi 11076914 


152 


3e- 


36 


/altid=gi 17291977 


151 


8e- 


36 


/altid-gi| 6320934 


145 


4e- 


34 


/altid=gi 17302240 


141 


9e- 


33 


/altid=gi| 6319415 


140 


2e- 


32 


/altid=gi[ 1622933 .. 


140 


2e- 


■32 
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BLAST to dbEST: 



gb|BF341923 
gb|AW370275 
gb|BE305043 
gb|BE249868 
gb|AI417892 
gb|BE896031 
gb|AI686098 
gb|AU143285 
gb|AU142966 
gb|BE903477 
gb|BE891121 
gb|BE886682 
gb|BE797128 
gb|BE796239 
gb|BE7 93727 
gb]BE781616 



BF341923 
AW370275 
BE305043 
BE249868 
AI417892 
BE896031 
AI686098 
AU143285 
AU142966 
BE903477 
BE891121 
BE886682 
BE797128 
BE796239 
BE793727 
BE781616 



602016449F1 NCI_CGAP_Brn64 Homo sapiens cD. 
RCl-BT0254-131199-013-fl2 BT0254 Homo sapi. 
601186706F1 NIH__MGC_15 Homo sapiens cDNA c. 
600942906F1 NIH__MGC_15 Homo sapiens cDNA c. 
tg55b01.xl NCI_CGAP_Pr28 Homo sapiens cDNA. 
601438825F1 NIH_MGC_72 Homo sapiens cDNA c. 
tt92a01.xl NCI_CGAP__Pr28 Homo sapiens cDNA. 
AU143285 Y79AA1 Homo sapiens cDNA clone Y7 . 
AU142966 Y79AA1 Homo sapiens cDNA clone Y7 . 
601676724F1 NIH_MGC_21 Homo sapiens cDNA c. 
601432236F1 NIH_MGC_72 Homo sapiens cDNA c. 
601507935F1 NIH_MGC_71 Homo sapiens cDNA c. 
601587118F1 NIH_MGC_7 Homo sapiens cDNA cl. 
601591808F1 NIH_MGC_7 Homo sapiens cDNA cl . 
601588 924 Fl NIH_MGC_7 Homo sapiens cDNA cl . 
6014 67472F1 NIH MGC 67 Homo sapiens cDNA c. 



Score 


E 


878 


0.0 


844 


0.0 


835 


0.0 


835 


0.0 


823 


0.0 


815 


0.0 


815 


0.0 


813 


0.0 


813 


0.0 


813 


0.0 


813 


0.0 


813 


0.0 


813 


0.0 


813 


0.0 


813 


0.0 


813 


0.0 
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EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 



gb|BF341923 
gb| AW370275 
gbiBE305043 
gbiBE249868 
gbi AI417892 
gbiBE896031 
gb|AI686098 
gbi AU143285 
gb!BE903477 
gbiBE891121 
gblBE886682 
gb|BE797128 
gb|BE796239 
gblBE793727 
gb|BE781616 



IBF341923 Brain 

IAW370275 Breast 

IBE305043 Colon adenocarcinoma 

(BE249868 Colon adenocarcinoma 

IAI417892 Prostate 

|BE8 96031 Skin melanotic melanoma 

IA1686098 Prostate 

IAU143285 Eye-retinoblastoma 

IBE903477 Placenta Choriocarcinoma 

1BE891121 Skin-melanotic melanoma 

IBE886682 Uterus-leiomyosarcoma 

IBE797128 Lung small cell carcinoma 

1BE7 96239 Lung small cell carcinoma 

|BE7 93727 Lung small cell carcinoma 

IBE781616 Eye-retinoblastoma 



Expression information from PCR-based tissue screening panels 
Human leukocyte 
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1 MGAYLSQPNT VKCSGDGVGA PRLPLPYGFS AMQGWRVSME DAHNCIPELD 

51 SETAMFSVYD GHGGEEVALY CAKYLPDIIK DQKAYKEGKL QKALEDAFLA 

101 IDAKLTTEEV IKELAQIAGR PTEDEDEKEK VADEDDGDHF YKRNKNLPPE 

151 EQMISALPDI KVLTLTDDHE FMVIACDGIW NVMSSQEVVD FIQSKISQRD 

201 ENGELRLLSS IVEELLDQCL APDTSGDGTG CDNMTCIIIC FKPRNTAELQ 

251 PESGKRKLEE VLSTEGAEEN GNSDKKKKAK RD (SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOS YLAT ION 
N-glycosylation site 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 4 



2 197-199 SQR 

3 253-255 SGK 

4 273-275 SDK 

[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 



Number of 


matches : 


10 


1 


38-41 


SMED 


2 


57-60 


SVYD 


3 


106-109 


TTEE 


4 


122-125 


TEDE 


5 


164-167 


TLTD 


6 


184-187 


SSQE 


7 


197-200 


SQRD 


8 


210-213 


SIVE 


9 


224-227 


TSGD 


10 


229-232 


TGCD 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 2 

1 2-7 GAYLSQ 

2 2 66-271 GAEENG 

[5] PDOC00009 PS00009 AMI DAT I ON 
Amidation site 



233-236 NMTC 



1 



10-12 TVK 



253-256 SGKR 



[6] PDOC00792 PS01032 PP2C 
Protein phosphatase 2C signature 



55-63 MFSVYDGHG 



FIGURE 




BLAST Alignment to Top Hit: 

>CRA| 18000005093929 /dataset=nraa /length=546 /altid=gi I 4505999 

/def=ref !NP_002698 . 1 | protein phosphatase 1G (formerly 
2C) , magnesium- dependent, gamma isoform; protein 
phosphatase 1G (formerly 2C) , ; protein phosphatase 2 r 
catalytic subunit, gamma isoform [Homo sapiens] 
/org^Homo sapiens /taxon=960 6 
Length =54 6 

Score = 301 bits (763), Expect - 6e-81 

Identities - 146/146 (100%), Positives - 146/146 (100%) 

Query: 137 GDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEWDFIQSKI 196 

GDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEWDFIQSKI 
Sbjct: 401 GDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEWDFIQSKI 460 

Query: 197 SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTAELQPESGKR 256 

SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTAELQPESGKR 
Sbjct: 461 SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTAELQPESGKR 520 

Query: 257 KLEEVLSTEGAEENGNSDKKKKAKRD 282 

KLEEVLSTEGAEENGNSDKKKKAKRD 
Sbjct: 521 KLEEVLSTEGAEENGNSDKKKKAKRD 546 {SEQ ID NO:4) 



Score = 284 bits (718) , Expect = le-75 

Identities - 137/139 (98%), Positives = 138/139 (98%) 

Query : 1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDSETAMFSVYD 60 

MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDSETAMFSVYD 
Sb j Ct : 1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDSETAMFSVYD 60 

Query: 61 GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAIDAKLTTEEVIKELAQIAGR 120 

GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAIDAKLTTEEVIKELAQIAGR 
Sbjct: 61 GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAIDAKLTTEEVIKELAQIAGR 12 0 

Query: 121 PTE DE DEKEKVADE DDGDH 139 

PTEDEDEKEKVADEDD D+ 
Sbjct: 121 PTEDEDEKEKVADEDDVDN 139 (SEQ ID NO: 5) 



>CRA i 18000005157254 /dataset=nraa /length=542 /altid=gi ! 667 97 93 

/def=ref I NP_032040.lt fibroblast growth factor inducible 
13 [Mus musculus] /org^Mus musculus /taxon=10090 
Length = 54 2 

Score - 284 bits (718), Expect - le-75 

Identities - 139/146 (95%), Positives = 141/146 (96%), Gaps = 1/146 (0%) 

Query: 137 GDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEF1WIACDGIWNVMSSQEVVDFIQSKI 196 

GDHFYKRNKNLPP+EQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEVVDFIQSKI 
Sbjct: 398 GDHFYKRNKNLPPQEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEWDFIQSKI 457 

Query: 197 SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTAELQPESGKR 256 

SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNT ELQ ESGKR 
Sbjct: 458 SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTVELQAESGKR 517 
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Query: 257 KLEEVLSTEGAEENGNSDKKKKAKRD 282 

KLEE LSTEGAE+ GNSD KKKAKRD 
Sbjct: 518 KLEEALSTEGAEDTGNSD-KKKAKRD 542 (SEQ ID NO: 6) 



Score - 279 bits (706), Expect = 3e-74 

Identities - 133/139 (95%), Positives = 138/139 (98%) 

Query: 1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDSETAMFSVYD 60 

MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELD+ETAMFSVYD 
Sbjct: 1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDNETAMFSVYD 60 

Query : 61 GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAIDAKLTTEEVIKELAQIAGR 120 

GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKAL+DAFLAIDAKLTTEEVIKELAQIAGR 
Sbjct: 61 GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALQDAFLAIDAKLTTEEVIKELAQIAGR 120 

Query: 121 PTEDEDEKEKVADEDDGDH 139 

PTEDED+K+KVADEDD D+ 
Sbjct: 121 PTEDEDDKDKVADEDDVDN 139 (SEQ ID NO: 7) 



Hmmer search results (Pfam) : 

Model Description Score E-vaiue N 

PF00481 Protein phosphatase 2C 
PF01722 BolA-like protein 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00481 1/2 25 102 . . 1 95 [ . 74.1 4 . 6e-20 

PF01722 1/1 99 110 .. 76 87 .] 3.7 9.5 

PF00481 2/2 137 228 197 301 .] 102.2 4.5e-28 



176.2 4.2e-49 2 
3.7 9.5 1 
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1 AAAGAATCTT TTTTTTTTTT TTGAGACGGA GTTGCTCTGT CACCCAGGGT 

51 GGAGTGCAGT GGCGCCATCT TGGTTCACTG CAACCTCCGC CTCTGGGGTT 

101 CAAGCTATTC GCCTGCCTTA GCCTCCCAAG TAGCTGGGAT TACAGGAGCG 

151 CACCACTACG CCTGGCTAAT TTTTGTATTT TTAGTAGAGA CGGGTTTCAC 

201 ATGTTGGCCA GGCTGGTCTC GAACTTCTGG CCTCAAGTGA TCCACCACCC 

251 CCCCTTGGCC TCCCAAAGTG CTGGGATTAC AAGTGTGAGC CACTGTGCCC 

301 GGCTGAAAAG AATCAATTTT GTCATAGTTT GGAGAATTTC TCCTTTTCTC 

351 TCCATCCCTT GAATGCAATT TATTACCAAA TCTGTCTTAT TTGTTATTGT 

401 CTAATTTGTC CTTTCATCTG GATTCCCATT GCCACCCTGC GTGGTACCAC 

451 CTTACTCCCA GCTCTTCTCA TCTCCTGCTT AGAGTAAGAG CTCTCTAACT 

501 AGTAGCAGTG CCCCAGGCCA GGCGCGGTGG CTCACGCCTG TAATCCCAGC 

551 ACTTTGGGAG GCTGAGGCGG GTAGATCACG AGGTCAGGAG TTCGAGACCA 

601 GCCTGGCCAA CATGGTGAAA CCCCGTCTCT ACTAAAAATA CAAAAATTAG 

651 CCAGGCGTGG AATCCTAGCT ACTCGGGAGG CTGAGGCAGG AGAATCGCTT 

701 GAACCTGGGA GGCGGAGGTT GCAGTGAGCC AAGATAGCGC CACTGCACTG 

751 CAGCCTGGGC AACAAGAGCG AAACTCTGTC TTAAAAAAAA AACAATAGTA 

801 GGCGGGGTGC GGTGGCTCAC GCCTGTAATC CCAACACTTT GGGAGGCCGA 

851 GGCGGGCGGA TCACGAGGTC AGGAGATGGA GACCATCCTG GCTAACACGG 

901 TGAAACCTCG TCTCTACTAA AAATACAAAA AATTAGCCAG GCGTGGTGGC 

951 GGGCGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCGT 

1001 GAACCCGGGA GGCGGAGCTT GTAGCCTGGG CGACAGAGCG AGACTCTGTC 

1051 TCAAAAACAA ACAAAAAACA ACAACACAAC AGTGCCCCAG ACTCTCTCCC 

1101 TCCAATGTAC ACT G CAT AC A AAGACTAGAC AAACGATGCC AAAGGTTACA 

1151 CCAGGACGAC AATGAAGTCC AAGTCACTCA TCTTGGCGCT TCTTTTTATC 

1201 AAGCTAGCTA GTTAATATTA CCACTTACAA TCATTCTCCG AGTCCAGCCA 

1251 GCTTCCTCAG AAATCCTCCC CAAATGCAGT TCACATTCTT CCCTCACTCC 

1301 TCCAAACCTT ATAACATTAT TCCCTTTTCC TGTGGCATAA TGCAATCCAG 

1351 AGGCATCCTC TCTGAGAGCC TACATCCTTT CAATCCTCCA AGAAACAGCT 

1401 CCTCCTCCCA TCCTTGAGCT TTTCCCACCC AGAATAGGCT GTACCAAACA 

1451 TTTCTACATT GTATTAACTC GAATTATATC ACAGATCGAG GTCTTCGCTT 

1501 T C AG AAAAG A CTCACATTCT T C T CAT AT AG GCCTCGCATG GCCTGGCCAA 

1551 GTACTTCCTA AGCTCGGAAC AAATACTGGT CAACTTTAAT TGAACCAAAT 

1601 CGGGCGGGGT TTGCGGAGTC TGAGAGTGCA ACGTTGGGGA GAGGGGGATG 

1651 AAAAC ACT GG AGGACGGGCT GAAAGCGTCG AGTCCGACAC AAAAGAGGCG 

17 01 TCAGACAAAA CGCCAAGAGG CTGGGGACTG GGAACGAAGG AAGAGGTTCT 

1751 GCCAGAGGCG ACCTGCCACC TGCGCGAGGA AGCGGAGTAG GACGGCGGCC 

1801 GTTGGTGGGC GTGGTCGCGC TAGTCTCGCG GGAGCGGCCG TTGGGCGGGC 

1851 CGTTGTCCCC TGCGGGCGGG GCGAGTTGCT AAGGAAATGA CTGCCCGCAG 

1901 CGCCTGGCCC CGCCGCGCAG GCCGGGCGGG GTCTGGAGCG GCGCCGTTTC 

1951 CGCTTCCGCT CCCTCACAGC TCCCGTCCCG TTACCGCCTC CTGGCCGGCC 

2001 TCGCGCCTTT CACCGGCACC TTGCGTCGGT CGCGCCGCGG GGCCTGCTCC 

2051 TGCCGCGCGC ACCCCCGGGG CTTCGGCTCC GGCACGGGTC GCGCCCAGCT 

2101 TTCCTGCACC TGAGCCGCCG GCCAGCCGCC GCCATGGGTG CCTACCTCTC 

2151 CCAGCCCAAC ACGGTGAAGT GCTCCGGGGA CGGGGTCGGC GCCCCGCGCC 

2201 TGCCGCTGCC CTACGGCTTC TCCGCCATGC AAGGCTGGCG CGTCTCCATG 

2251 GAGGTGAGGA GGCAGGGGCC CATAGGCTGG CCGCTGCGGG GCGGGAATCT 

2301 GACGGAGAAA GAGAGCGGGG GATGGGGTCC TCCCCTGGGA AGGGTCCCAA 

2351 TTGGGAGCCT GCGGCCGCAG CGGCCGTTTG CGGGGCGACA GAGACCGCGG 

2401 GGTCAGGGCC GAGAGGGAGC TCCCGATGCT TGGGGACCGT GTGCCGGTGG 

24 51 CCAGTGGCGG CGAGGGCTCG GCCATGTGGG AAGAGGCACC TTCCGCCCAC 

2501 TGACCGCCCT CTCCCCGAGC TTTGGCGCCA TCCTCTCGTG CCAACCTAGC 

2551 CCTCCAGGCT CATCAACGGT GTGAGGTTTA GTGTGGGAGT AAAGACGCAA 

2 601 ATAGGGGCTA TTTATTCATT TGTTTTTCAA GGGAGAGGTG AATAGATGTG 

2651 AATAACTTTT TAAATTTTAA TATTTAAAAT ATCTGATGTG GGAAGCCTCT 

2701 TTTGGCTAGG AGTTTGACAG TGAAAGGAAC CCCGGGCAGA GTCTGTTTCA 

2751 CATTTTGGTT GCCTGGCCTT GGGCTCTTGC ATGTTAATTT CAGAGGCTGG 

2801 ACCCGACCTC CAGGAGTTGT CACTCATTTG CACTCTTTTC AGGGCCTTTT 

2851 ACTAACTTCG GAAAACTTGA ATTATGTCAG TCCCTAGGTT TTCCTTTTTA 

2901 TTATGCTTCG TTTTCTTTCT CTCTTAGGAT TTCTCTAAAA CTTAATCAGT 

2951 AATTCTCATT TGCCTGTAAT TGTAGTTTAT AGTTTCTTTG ATTGTAAGTC 

3001 ATTTCATCAA TTTTTTTCAT CACAACCTAC CTACAAAGGG CTTTTCTAGA 

3051 AAATTTTACT CTGGACAAAA GGGGAAAAGA AAAATATTGG GGGGAAAGTA 

3101 GTAGTATTAG GTAAAACTTG ATGTGAAACT ACAAAAGAGA AGAGG GAAAA 
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3151 CTGCGGTAGG GAGGAAAGGG AGGAAGACGG GTTAACCGTG GCTTTGTGAA 

3201 GAGCATTCTG AAGTCTAGGC AAAAGGGCCA GGGAAATACT CTGTCTGGTA 

3251 TTGAGGGTTT CTCCACCTAC CGGGTGGGCT TCAGGTAACA GCGAAATACT 

3301 GTCTCCCTTG GGAATTGTTT CAGATCCCTC GCTCCTCCTG TGGTTAGCTC 

3351 TGGAATGCCA GTATGAACTT CAATGTTTTG TTTTCCGATT CAAATTTTAT 

34 01 ATTCATAACT GACCTTAATA ACAATTTTAC AATTAGGTAT AAAATTTCAG 

34 51 GATCCTAGTG TATCCTATAG TTCATCTCAT CTGCTTTGGC TCCCTTTTTT 

3501 TTTTTTTGAG ACAGAATTTC GCTCTTGTTG CCCAGGCTGG AATGCAATGG 

3551 CGCGATCTCG GCTCACCGCA ACCTCCGCCT CCCAGGTTGG AGCAATTCTC 

3601 CTGCCTCAGC CTCCATAGTA GTTGGGATCA CAGGCATGTG CCACCACGCG 

3651 TGGCTTATTT TGTATTTTTA GTAGAGACAG GGTTTCTTCA TGTGGGTCAG 

3701 GCTGGTGTTG AACTCCTGAC TTAGGTGATC TGCCTGCCTC GGTCTCCCAA 

37 51 AGTGCTACCA CCACNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

44 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

47 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

6001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

6051 NNNNNNNNNN NNNNNNNNNN NNNNNTGGGA TT AC AG AC AT GAGCCACTGC 

6101 GCCCAGCCTT ATTTAGAAAT TTCTTTAGTG AAAGATGATA AATTTTCAGT 

6151 TTTTCATTAT CTGAACATGT TTTTATCTAG CCTTTGTTCT GAAAAGATGC 

6201 TTGGACTCAG TACCCAGTTC TAGATTGACA GTTAATTTTT CTTAATTTGT 

6251 AAATGTTGTT TCATTGATTG ACTTCCATTG TTGTTCGGAA AAATTTATCA 
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6301 TCAGCCATTT CTGACTTTTG ATCTGTGTTT TCTCTTTGGT TTCTCTTTTT 

6351 TTTTTTCTTT TTTTTTTTTT GAGACGGAGT GTCGCTCTGT TGCCCAGGCT 

64 01 GGAGTGCAGT GGCATGATCT TGGCTCACTG CAACCTCTGC CTCCTGAGTT 

6451 CAAGCGATTC TCCTGCCTCA GCTTCCCGAG TAGCTGGGAT TACAGGCGCC 

6501 TGCCACCATG CCCGGCTAAT TTTTTGTATT TTCAATAGAG ACAGGGTTTC 

6551 ACTATGTTGG CCGGGTTGGT CTCCAACTCC TGACCTCTTA ATCCGCCCGC 

6601 CTCGGCCTCC CAAAGTGCTG GCATTACAGG CGTGAGCCAC CATGCCTGGC 

6651 CCATCAGTTG ATGTAGTCTT AAGGGGACAA GAGTACATTT AATATTTGGT 

6701 TGTGAGGTTC TCTGGAAGTG ACAAAACTGC TTTCTATGGA GAGTTAGGAA 

6751 TTTTTTTTTT TTTTGAAACG GAGTCTCGCA TTGTCACCCG GGCTGGAGTG 

6801 TAGTGGCTTG ATCTCGGCTC ACTGAAACTT CCGCCTCTTG GGTTCAAGTG 

6851 ATTCTCCTGC CTCAGCCTTC CAAATAGCTG GGATTACAGG AGTCTGCCAC 

6901 CAGGCCAGCT AATTTTTTTT TGTATTTTTA GTGGAGACAG GATTTCACTA 

6951 TGTTGGCCAG GCTGGTCTCA AGACTCCTGA CGTTGTGATC CACCTGCCTC 

7 001 GGCCTCCCAA AGTGCTGGGA TTACAGATGT GAGCTACCGT GCCCGGCCAG 

7 051 GAATTTTTTG TGCTATAAAT CATATTTTCC TTTTATTAAA GGCAGTGTCA 

7101 ATATCTATAG TATAATTTTG AGGAGGCTGG CTATTTATTG CTGTGTAGAA 

7151 GCTGGCTTAT TAGTGGTCAA GGGGTCATCT AGAATTGACT ATAAAGATAG 

7201 TATTGAGCAG AAAATTCTTA AAATACCTGC ATATTAGTTT CAGTCATTAA 

7251 ATTAATGGAA AAAATATAAA AAGAAATATC ACAAGTATGC TATGGGGTTC 

7 301 TACCTTAGGG CATTGAAGGT TGAAAAACAT TTATTTTCTT ATCTTCAGAT 

7 351 TAGCATCTCA TATCAATAGC CAATAGCTTA AAGCGCTTTT TACTTACTAA 

7401 ACCAGGTCAG AATTTCTCTC TCTCTCTCTT TTTTTTTTTT AGACAGAGTC 

7451 TCTGTCACCC AGTCTAGAGT GCAGTGGTGT GATCTCGGCT CACTGGCCTT 

7 501 TCTGTTTAAG TGATTCTCCG GCCTCAGCCT CCTGAGTAGC TGGGATTACA 

7 551 GGCATGTGCC ACGACACTTG GCTTATTTTT TGTATTTTTA GTAGAGATGG 

7 601 GGTTTCTCTG TGTTGGTCAG GCAGGTCTTG AACTCCTGAC CTCAGGTGAT 

7 651 CTGCCCGCCT CCGCCTCCCA AAGTGCTGGG ATTACAGACG TGAGCTACTG 

7 701 CGCCTGGCCA GAATTTCTTT GTCTAGAATG TGGTTAGCAA CTTTTATAAA 

7751 AACGCATTAT TTGCATTTGA TTAGCATGCA GTACCCATTC ACAGTTCAAA 

7 801 GCTAGTATAG AATTATATCA CATGTATGCC CATGAGCATG GAGAAACTAT 

7851 TTTCTTTTTA TTTTTTTAAG TTGGAGTTTT GCTCTTGTTG CCCAGGCTGG 

7 901 AGTGCAATGG TGCCATCTCG GCTCACTGCA GCTTCTGCCT CCTGGGTTCT 

7 951 AGCAATTTGC CTACCCCAGC CTCCCAAGTA GCTGGGATTA CAGGCACTCG 
8001 CCACCATGCC CAGCTAACTT TTTTGTATTT T TAG TAG AG A AGGGGTTTTC 
8051 CCATGCTGGC GAGTCTGGTC TTGAACTTCT GGCCTCAAGT GATCTGCCCG 
8101 CCTCAGCCTC CCAAAGTGCT GGAATTACAG GCATGAGCCA CTGTGCCCGG 
8151 CCTTTTTATT TTTTAAATTA TTTATGTATT TATTTTGAGA CAGGATCTCA 
8201 CTCTTGCCCA TGCTTGAGTG GTAAGGAGTA TGGGATTTCT TGTGCCGGTT 
8251 CCTTCACATC CTCACTACAC TTATCTGCCT TTACAGTGGC TCGATCATAG 
8301 TTCACTGCAT AGCCTTCTGG GCTCAAGGGG TCTTCCAGCC TCAGCCTAAT 
8351 ATAGGCACAT GCCACCATGA CTGGCTAATT TTTTTTTTTT AAGTTTTTTT 
8401 TTGTAGAGAT AGGGCCTTGC AGTGTTGCCC AGGCTGAGGA ATTTTATTTA 
8451 TGTTTATTTT AT T TAT T TAT TTATTTATTT ATTTATTTAT TTATTTATTT 
8501 ATTGAGATGG AGTCTTACAC TGTCACCCAG GCTGGAGTGC AGTTGCGCGG 
8551 TCTCAGCTAC TGCAAGCTCC GCCTCCCGGG TTCATGCCAT TCTCCTGCCT 
8601 CAGCCTCCCG CGTAGCTGGA CTACAGGCGC CCGCCACCAC GCCTGGCTAA 

8 651 TTTTTTGTGT GTGTTTTTAG TAGAGGCGGG GT T T C AC CAT GTTAGCCAGG 
8701 ATGGTCTCGA TTCCTGACCT CGTGATCCAC CCACATCGGC CTCCCAAAGT 
8751 GCTGGGATTA CAGGTGTGAG CCACCATGCC TGGTCTAGAA ATTATTTTAT 
8801 ATTTTATACC ATTGCCTTAT AAGTTCTCAA GCAACTGGAA AATACAATCA 
8851 GAACGTATTC CTCAAGATTT CAAGGATATT TTACACAAAG TTCTATTGTC 
8901 TGATTCCTTA GCAGTTGTTA CTACTGTTTC CCTAACCTCT AATCTTCTAT 
8951 TGGGTTATTA GTCTTAGAAT TGAATTTTGA GAGGTAAGGG CTTGAATTTG 
9001 AACATAGAAA TTTATACAGG TCTGATCAGT AGTTCTTGAC ATTGTATTAT 
9051 CTGGAAACAA ATCTTTAGAA CTGAGCTTAA GATGTTTAAT GACATTTTGT 
9101 AGACAGAGTA TGATTTCAGT GTAGTTGTTT TTGTTTCTTT CTAGATCTAG 
9151 TTCAGAGATG AAGTATATCA ACTTTTTTTT CCTTTTTGAC CCAATGCTAG 
9201 CAGAAAAACA ACACCTTTTA ATCATATTTA GTATTTGAAA ATGTGTATAC 
9251 AGGTTCCTTT TTATTTTATT TATTTCTTAC AGGTTCCTTT TTAATCAGCT 
9301 T TAT T GAG AT AGAGTTCATA TACTGTATGG TTCATACCAC ATATGGTTCA 
9351 TAT AC CAT AC AGTATATGAA CTCACTTTAA GAGTATAATT CAGTGGGTTT 
94 01 AAGGGTATAA TTCATTCATT TTAAGGGTAT AATTCAGTGG CTTTTAGTAT 
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ATTTTCTTTT TTTTCTTCTT TTTTTCTTTT TTCGAGACAG GGTCTTGCTC 
TGTTACCCAG GCTGGAGTGC AGTGACGCAG CCTCAGCTCA CTGCAACCTC 
CACCTCCCAG GTTCAAGCGA ATTCTCTTGC CTCAGCATCC TGAGTAGCTG 
GAACTACAGG CTCACGCCAC CACACCCAGC TAATTTTTAT ATTTTCAGTA 
GAAACAGGGT TTCACCATGT TGCCCAGGCT GGTCTCGAAC TCCTGGCCTC 
AAGTGATCTG CCTGCCTCAG CCTCCCAAAG TGCTGGGATT ACAGGCATGA 
GCCACCGTGC CTGCCTGTTT TGTAGTGTAT TCAAACAGAG TTGTACAACT 
GTCACCACAA TCAGTTTTAG AACCCCAAAA AGAAACCCTG TACTCTTTAC 
CAGTCACTCC CTATCTTCCG TCCACTAACC CCTGGCATCC ACTAATTTAC 
ATGACCTCTA TGAATTTGCC TATTCTGAAC ATT T TAT AAA TGGAATTCTA 
AATACACTAC CTTTTATATC TGGATGCTTT TACTAAGCAT GTGTATTTTT 
GAAATTGACT TTAAAGCTTG TTGGCCCCTG GAAGAGTAAA TTACTCTCCA 
CCCCAAGTAT TCCCTCTACC CCTCAGCTTT GCCTGTAAGT TTCTTTTTAA 
AAAAATCACA CATACATTGT TGTAGTAGAT TTAAGAATAA GTATTTTGCT 
GACCCAAGGT TCTTTTGCTT CTTTCTAGAT CAGTGCCTTG CAGGTTTTAT 
TTTACAGAGC TTAATAGAAT CAGAAATCTC TTTAAAACTC CAGTCTCATA 
TCCAGTTATC ACTCACCATC TCTGTGTTTG CAGCAATAGC CAGGCCTGGC 
CCAGAGGGAC TTGATCTCCA CTTTTGGTTT TTAGACTTTT CTGTGGCTTT 
TACCACCTGC TGTGTATCCT TGACCTCATA CTGCTGGACT CCTTTGGATG 
GATACCAGCA GGATGGTTCA GGCTCCAGTG GGCACTTTTT AAAATTCTCT 
CCTTCTGTTC AGATAGACAG AGCTCAGGCA GATCACCAAG TCTGTTGCCT 
GTGTAACCAG GAAAGAGATG CTAGTTTTCT TTTAGGCACT CCCATTTGTT 
TCTGTTGGAA CCTTCCTCAC TTAGTTGATG GAAGGGAAGC AAAAGACCCA 
GAACTCCATC TCAAATTAAT GACTTAACAA TTCTTGAATT TTCTCTTATC 
TCCTAGTTAA CTCTTTTTCT TATCTCCAGG AGGTCAGTTT TAATTATTGT 
TGTTTATTCA TTTATTTTCA TGGAGACAGG GTCTTACTGT GTTGTCCAGG 
CTGGCCTTGA ACCCCTGGCC TCAAGGAATC TTTTCACCTC AGCCTCCCAA 
AGTGCTGGAA TTACAGGCAT GAGTCACCAC ACCCAGCCTG ATATTTTTCA 
GTTGATGTAT CATAGTTGTG CCTAAGCATA ATTTTTTAAT TTTAATTTTT 
TATTTTTGGG GACAGGGTCT CCCTCTGTCG CCCAGGCTGG AGTGCAGTGA 
TGCGATCTCA GCTCACTGCA ACCTCCACCT TCTGGGTTAA AGCGATTCTC 
CTGCCTCAGC CTCCCGAGTA GCTGGGACTA CAGGTACCCA CCATCACACC 
CGGCTAAATT TTTTGTGTGT ATTTTAGTAG AGACGAGATT TCGCTGTGTT 
GCCCAGGGCG GTCTCGAACT CCTGAGCTCA GGCAATCCGC CTGCCTCGGC 
CTCCCAAAGT GCTGGGATTA CAGGCATGAA CCACCACGCC CGGCCAAGCG 
TAATATTTTT AAGGGTCATC AATGTTGTGT CAT GAAT C AA TCAGTGTTTC 
GTTCTTTTTT ATGGTTGAAT AATATTCCAT GGTATGGATT TGTCACATTT 
TGTTTATCCA TTCATTAGTT GATAGACATT TTGGATTTCC ACTTTTTTTT 
TTTTTTTTTT GCTATTATAA ATAGTGATAC TATGTACAAA TTTTTGTGTG 
GAAATATGTC CTCATATCTC TTGGTTATAT ACCAAAGAGT GGAAGTGCTG 
GGTCATATGG TAACTACGTG TTTAACATTT TGAGAAACTG CTAAACTGTT 
TTCCAAAGTT GCTGTACCGT CGTACATTCC TGCCAGCAAT ATATGAGGAT 
GCCAGTTCCT TCACATGTTC ACTACACTTA TCCACCTTTT TTATAATAAC 
TAATGGTGGG TGTGAGATGG TATCTCATTG TAGTTTTGAT TTGTATTTCT 
CTGATGGCTA AATGGCTAAT GATGTTTGAA CTTTTTGTTT GAGACAGAAT 
CTCACTCTGT CCAGATTCAA GCGATTCTCC TGCCTCAGCC TCCCTAGCAG 
CTGGGATTAC AGGCACATGC CACCACACCC AGCTAATTTT TTGTATTTCT 
AGTAGAGACA GGGCATTACC ATGTTGTTCA GGCTGGTGTC GAACTCCTGA 
CCTCAAAGGA TCCGCCTCCC TGGGCCTCCC AAAGTGCTGG ATTACAGGCT 
AGAGCCACCA TGCCAGGCCT TATGTTTGAA CATCTTTTAT GTGCTTATTG 
GACATTTGTG TATCTTCTTT GGAGAAATGT CTGTTCAAAG TCTTTGTCCA 
TTTTTAATTG GATTGTCTTT TTGTCTTTTG ATGTGTAAGA GTTCTTTATG 
TGTTTTGGAT ACAAGTTTGT TAGATATATG ATTTGCAAAT CTTTTCTCCA 
ATTTTTGTGG ACTTTTGCTT TCTTTTTTTG TTTTGTTTTT GTTGTTGTTG 
TTGTTGTTGT TGTTTTGGTC GGGGGACAGT CTTGCTCTGA CCACCCAGGC 
TGGAATGGAG TGGCGCGATC TTGGTTCACT GCAACCTCTG CCTCCTGAGT 
TCAAGCTATC CTGCTTCAGC CTCCCGAGTA GCTGGGACCC AGGTGTGTGC 
CACCACTCCC AGCTAATTTT TTATTTTTAG TAGAGACCGG GTTTCACCAT 
GTTGGCCAGG CTGGTCTTGA ACTACTGACC TCAGGTGATC TGCCTGCCTC 
AGCCTCCCAA AGTGCTGGGA TTACAGTCAT GAGCCACTAC ACCCTGATTC 
TTTTTGCTGG CTTTCTTTCT TTTTTTTTCT TTTTTTTTTT GAGACGGAGT 
CTCGCTCTGT TGCCAGGCTG GAGTGCAGTG GCATGATCTC GGCTCACTGC 
AACCTCTGCC TCCCGGGTTC AAGCCATTCT CCTGCCTCAG CCTCCCGAGT 
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12 601 AGCTAGGACT ATAGGCACAT GCCACCATGC GCAGCTAATT TTTGTATTTT 

12 651 TAGTAGAGAC GGGGTTTCAC CATGTTGGCC AGGATAGTCT CGATCTCTTG 

12701 ACCTCGTGAT CCGTCCGCCT GGGCCTCCCA AAGTGCTAGG ATTACAGACG 

12751 TGAGCCACCA CACTCAGCCT CTTTTTGCTT TCTTGATGGT GTCTTTTGAA 

12801 ACAAAAGTTT TTACTTTTGA TAAAGTCCAA TTTGTCTATT TTGTTTGTTT 

12851 GTTTTTGTTA AGAAGCTTTG CCTAACCCAA AGTCACGAGA ATTTTCTCTT 

12901 AGGTTTTCTT CTAAGAGTTT TATAGTTTTA GCTGTTTCTG TGATCCATTT 

12951 TGAGTGAATT TTTGTGAATG GTATGAGGGA GTGATCCAAC TTCATTCTTT 

13001 TGTGTGTGGA TATCAAGTTG TCCCAGCACT ATTTGTTTAA ACCACTGTTC 

13051 TTTTCCCCCA TTGAATTATC TTGGCATCAT TGTCAGAGAT AAATTGACCG 

13101 TAAATGTGAG GGTTTTATTT CTGAACTCTC AAGTCCATTT CATTGGTCTA 

13151 CATGTCCCTA TGCCAGTAAT ACACTATCTT GGTTACTGTA GCTTTTTAGT 

13201 ACGTTTTGAA ATGTTTTTAA AATTTGTTTT TCATCTAAAT TTTAGGATTA 

13251 ATTTGTCAAT TTCTGCACAA AAGGCACCTG GGTTTCTATA GGGGTTATGC 

13301 AGAATCTGTA GATCAACTGG GGGAGTATTA CAGGCATGAG CCACCGTGCC 

13351 TGGCTGACTG AGTTTTTCAT AGATGTACTC TATCAGGTTT AGGAAGTTCC 

13401 CTTTTATTCC TAGGTTGTTG AGTCTATTTT ATATTACTTT TTTAGAGACA 

13451 GTCTTGCTCT GTCCCTCAGG CTGTAGCACA GTGGCTCAAT CATAGCTCAC 

13501 TGCAGCCTTG AACTCCTAGG TTCAAGAGAT CCGCCTGCCT CAGCCTTCTT 

13551 AGTAGCTGGG ATTACATGCA TGCACCACCA TACTGGGCTA ATTTTTTAAA 

13601 ATTTTTTATA GAGACAGGGT CTTATTACTA TGTTGCCCAG ACTGGCATTG 

13651 AGTCTTTTTA TCATTAATGA GCACTGAATT TTGTCAAGTG CCTTTATAAT 

13701 ACCTATTGTG ATGATCATAG GGTTTTGTTC TTTAGTCTAC CGATACGCTA 

13751 TATTGCATTA AGTGATTTTT TTGAATGTTA AACCAACCTT GCATTTTTTT 

13801 GGTGTATAAG TCTTATTTGA TCAATGTGTA TTATCCTTTT ATATGGTGCT 

13851 GGATTTAGTT TGCTACTATT TTGTTGAGGA TTTTTGTGTC TATATTCATA 

13901 AG AG AT AT TG GTCTGTAGTT TCTTGTGATG TCTTTGTCTG GTTTTAGAAT 

13951 CAGGGTAATG CTGGCCTCAT AGAATGAATT GGGAAGTGTT GTCTTTTCTA 

14 001 TGTGATGGGA GAGTTTGTGA ATCATTGGTA TTAATTTTTC TGTAAATGTT 

14051 TGGTAGAATT CACAAATAAA GGCATCTGAG CCTGGGCTCT TCTTTGTGGG 

14101 AAGTTTTTGG CTTTTTTTTT CTTTAAAAAT TTTCATTGTG GCTGGGCATG 

14151 GTGGCTCACG CCTGTAATCC CAGCACTTTG GGGGGCCAAG ACGGGTGAAT 

14201 CACCTGAGGN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

14251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

14301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

14351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

14401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

14 451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNGAG 

14501 TGCAATGATT GCAGTCTTGG CTCACTGCAA CCTCTGCCAC CTGGGCTCAA 

14 551 GAGATTCTCC TGCCTCAGCC TCCTGAGTAG CTGGGATTAC AGGCATGCGC 

14 601 CACCATGCCC AGTTAATTTT TGTATTTTTA GTAGAGACGG GGATTCTCCA 

14 651 TGTTGACCAG GCTGGTCTCT AACTCCTCAC CTCAAGTGAT CCGCCCGCCT 

14701 CGGCCTCCCA AAGTGCTGGG ATTACAGGCA TGAGCCACCA CGCCCGGCCT 

14751 TAAAAATTTT TTTAATGTAC AGTTGAGTAG TATTTAATAC ATT C AC ATT G 

14801 TTGTGTACCC AGTTTCCAGA ACTCTTCATC CTACAGAACT GAAACTCCAT 

14851 ACCCATTAAA TGAGTCCCCA TTCTCTTTCC CCCAGCTCAT GGCAAACAGC 

14 901 ATTCTATTTT CAGTCTCTAT GAATTTGATT AGTTTAGATA CTTCATACTG 

14 951 TAAGTGGAAT CATATGGTAT TTGTCTTTTA GTGACTGCCT AATTTAAAAA 

15001 AAATTTTTTT GAGACGGAGT CCTGCTCTGT CGCCCAGGCT GGAGTGCAGT 

15051 GGCACCATCT CTGCTCACTG CAACCTCCAC CTCCCAGGTT CAAGTGATTG 

15101 TCCTGCCTCG GCCTCCCACG TAGCTGGGAT TACAGGTGCT CGCCACAACA 

15151 CCCGGCTAAT TTTTGTATTT TTAGGTAGAG ACTGGGTTTC ACCATGTTGG 

15201 CCAGGCTGGT CTCGAACTCC TGACCTCAAA TTATCCACCT GCCTTGGCCT 

15251 CCCAAAGTGC TGGGATTACA GGCGTGAGCC ACTGTGCCCA GCCTCCATGT 

15301 TGTTTTTCAC AACACCTGTA TCATTTACAT TTCCACCAAC AGTACACAAG 

15351 AATTTCAGTT TCTCCACATC CTTGCTAGCA GTTGTTATTA TCTGTTTTTT 

15401 TTTAATGGTT TCTTTTTTCC TTTTTCTTTT TTTTTTTTTT TGAGACGGTC 

15451 TTATTCTTGC TCATGCTGGA GTGCAGTGGT GCAATGTGAT AGCTCACTGC 

15501 AGCCTCAACC TCTGGGCTCA AGCAGTCCTG CCACCTCAGC CTCCACATAG 

15551 GTGGGACTGC AGGTGTGCAC CACCACTTGT GGCTAATTTA AAAAATTTTT 

15601 TCGTAGAGAC AGAGTCTCAC AGTGTTACCC AGGCTGGTCT TGAACTTCTG 

15651 AGCACAAGTG ATCCTCCCAC CTCAGCCTCC CAAAATAATG AGATTAGAGA 

15701 CATGAGCCAA CATGCCCAAC CAGTTTTGTT TGTTTGTTTT GTTTTGTGTT 
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TTTGAGACAG AGTCTCACTC TATTGCCCAG GCTGGAGTGC AACGGCATGA 
TCTTAGCTCA CTGCAACCTC CGCCTCCCAG GTTCAAGTGA TTCTCATGCC 
CCAGCCTCCT GAGAAGCTGG GATTACAGGT GTACCACCAC ACCCAGTTAT 
TTTTGTATTT TTAGTAGACA TGGGGTTTTG CCATGTTGGC CAGGCTGGTC 
CCGAACTCCT GACCTCAAGT GATCTGCTCC CCTCAGCCTC CCAAGGTGCT 
AGAATTAAGT TTTTCTTTCT TTCTTTCTTT CTTTTTTTTT TTTTTTTTGA 
GACAGAGTCT CACTCTGTCA CCCAGGCAGG AGTGCAATGG CACGGTCTTG 
GCTCATTGTA ACCTCTGCCT CCCAGATTCA AGTAGTGATT CTCCTGTCTC 
AGCCTCCCAA GTAGCTGGGA TTACAGGCAT GCACCACCAC GCCCAGCTAA 
TTTTTTGTAT TTTTAGTAGA AACGGGGTTT CACCATGTTG GTCAGGCTGA 
TCTCAAACTC CTGACCCCAA GTGATCCACC CGCCTTGGCC TCCCAAAGTG 
TTGGGATTAC AGGCGTGAGC CACTGTGCCT GGTTTTATTT TT ATT AT TAT 
TATTTTTAAT AGTTCCTATT CTAATGGGTA TGAGGTAGTG AGGTGGGTGG 
TTGTGGTGTT TTTATGAATG TTTAATTGGA AATGGGTGGC CATTGTGTGC 
AGGAAAAACC TCCTAAATTG TGTCAAACTC CTGGAAAATG AAATATCATT 
CCAGTTGCAA GAATATCTTT TTTTTTTTTT TTTTTTTTTA AG AC AG AG TC 
TCACTCTGTC ACCAGGCGGG AGTGCAGTGG CACGATCTCG GCTCACTGCA 
ACCTCCGCCT CCTGGTCCGC CTCCCGGGTT CAAGTGATTC TTCTGCCTTA 
GCCTCCCAAG TAGCTGGGAC TACAGGCGCG TGCCACCACT CCTGGCTAAT 
TTTTTTGTAT TTTTAGTAGA GATGGGGTTT CACCATGTTG GCCAGGATGG 
TCTCAATCTC TTGACCTAGA GATCCGCCTG CCTCGGCCTC CCAAAGTGGT 
GGGATTACAG GGGGGTCACC GTGCCCAGCC ACAAGAAGAT CTTGAGCATG 
TGAATGATCA GAAATGATTT AGCCTATGTA GGCACTAGGC CAGGTAGTGA 
AATTCAGGGA AAATAATTCA GATGCTTCTG AGCTATCACT TATGAACTAA 
GAAACAGCTT AAAGCCATTA TAGTGTGTTT CCTGAAGATG AAAGCATATG 
GTAAGATGAA ATAGTGATTA TTTTTTAAAA ATTACTACTC CAGAAAGGAA 
AAGTTTACTA ATTTTTATTA CTAAAGTTTA CTGTTGGTGG GTGCGGTGGC 
TCACACCTGT AATCCCAGCA CTTTGGGAGG CCGAGGCAGG CGGATCACCT 
GAGGTCAGGA GTTCGAGACC AGCCTGACCA ATATGGTGAA ACCCCATCTC 
TGCTAAAAAT AAAAAATTAG GCCGGGCGCG GTGGCTCATG CCTGTAATCC 
CAGCACTTTG GGAGGCCGAG GCAGGTGGAT CACGAGGTCA GGAGATTGAG 
ACCATCCTGG CTAACACGGT GAAACCCCGT CTCTGCTAAA AATACAAAAA 
TCAGCCGGGC GTCTTGGCAG GCACCTGTAG TCCCAGGTAC TCAGGAGTTT 
TGAGACGGGA GAATGGCGTG AACCCGGAAG GCGGAGCTTG AAGTGAGCCG 
AGATTGCGCC GCTTCAGTCC AGCCTGGACG ACAGAGTGAG ACTCTGTCTC 
TAAAAAAATA AATAAATAAA AATAAAAAAT TAGCTGGGTG TGGTGGCACG 
CACCTGTACT CCCAGCTACT CGGGAGGCTG AGGCAGGAGA ATTGCTTGAA 
CCCGGGAGAT GGAGGTTGCG GGGAGCCAAG ATTGCGCCAC TGCACTCCAG 
CCTGGCGACA GAGTGAGACT CTTTCTCAGA AAAAAATATG ATAATTAAAA 
GTTGAGACGT TCTTCGCCGA GAGTGGTCGG GGTTTCCTGC TTCAACAGTG 
CTTGGACGGA ACCCGGCGCT CGTCCTGCAC CCCGGCCGGC CGCCCATAGC 
CAGCCCTCCG TCACCTCTTC ACCATGCCCT CGGACTGCCC CAAGGCCCCC 
GCCGCAGCTC CAGCGCCGCG TAGCCACCAC TGCCGCTGCC GCCGCCTCTC 
CTTAGTCGCC GGCATGACGA CCGCGTCTAC CTCGCAGGTG CGCCAGAACT 
ACCACCAAGA CTCAGAGGCC GCCATCAACC GCCAGATCAA CCTGGAGCTC 
TACGCCTCCT CCATTTACCT GTGCGTGGCT TACTACTTTG ACAGCGATGA 
TGTGGCTTTG AAGAACTTTG CCAAATACTT TCTTCACCAA TCTCATGAGG 
AGAGGGAACA TGCTGAGAAA TTGATGAAGC TGTAGAACCA ACGAGGTGGC 
CGAATCTTCC TTCAGGATAT CAAGAAACCA GACTGTGCGG GGAGAATGCG 
ATGGGAGAGC GGGCTGAATG CGATGGATTA CATTTGGAAA AAATTGTGCA 
TTTTGCATTA CATTTGGAAA AAAATGTGAA TCAGTCACTA CTGGAACTGC 
ACAAACTGGC CACTGACAAA AATGACCCCC ATTTGTGTGA CTTCATTGAG 
ACACATTACC TGAATGAACA AGTGAAGGCC ATCAAAGAAT TGGGTGACCA 
CGTGACCAAC ATGCACGAGA TGGGAGCGCC CGAATCTGGC GTGGCAGAAT 
ACCTCTTTGA CAAGCACACC CTGGGAGACA GTGATAATGA AAGCTAAGCC 
TCAGGCTAAT TTCCCCATAG CCATAGGGTG ACTTACCTTG TCACCAAGGC 
AGCGCATGTA TGTTGGGGTT TCCTTTACCT TTTCTATAAG TTGTTCCAAG 
ACACCCACTT AAGTTCTTTG ATTTGTACCA TTCCTTCAAA TAAATAAATT 
TGGTACCCTC CCCCCCCCCA AAAAAAAAAT GTACTGTGGG CTGGCGTAGT 
GGCTCATGCC TAAATCTCAG CACTTTGGGA GGCTGAGGCG GGAGGATCAC 
CTGAGGTCGG GAGTTTGAGA CCAGCCTGGG CAACATGGTG AAACCCCGTG 
TCTACTAAAA ATATAAAAAC TAGCCAGTCA TGGTGGCACA CACCTGTAAT 
CCCGGCTACT TGGGAGGCTG AGGCATGAGA ATCACTTGAA CCTGGGCTGC 
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GGAGGTTGTA GTAAGCTGAG ATCATGCCAC TGTACTCCAG CCTGGGTGAC 
AGGGAGACAT TCTCTCTCTC AAAAAAAAAA AAAAAACAAA AAAAACAAAA 
CAAACCAACA AAACAAAGTA ATCCAGGAAC AACAACATGA TGAAGGACTG 
CATGCAGGAC TCAGTGATGG ATGGTGGAAG ACAGCCAGGA AGTTAAGCAT 
GACTCTGGTA TTAAGTGTTG TCTGGGAGAG TTAAGATTCC ATTTACAGAA 
ATAAGACCTG TAGGGGAAGC TCTTGATTTT TTTTTTTTTG CAGACTGCTG 
ATTTCCTGAT TACATGTGTT AAGTTTGAGG TATAGAGAGA AAGAACATCC 
TGGCCGGGTG CAGTGGCTCA CACCCGTAAT CCCAGCACTT TGGGAGGCCA 
AGGTGGGCAG ATCACGAGGT CCAGGAGATC GAGACCATCC TGGCCAACAT 
GGTGAAACCC CGTCTCTACT AAAAATACAA AAATTAGCTG GGCGTGGCGG 
CGCGTGCCTG TTATCCCAGC TACTCAGGAG GCTGAGGCAG GAGAATTGCT 
TGAACCCGAG AGGCAGAGGT TGTGATGAGC CGAGATCGCG CCACTGCACT 
CCACCCTGGC AACAGAGCTA GACTCTGTCT CAAAAAAAAA AAAAAGAAAG 
AAAAAAAGAA CATCCTGTAG AAACAGGCAG TCAGAGGTAT AGAACTACAC 
AGAATCCAAG AGATCTTTCA AGAAAAGTGA CATGCAGCAA GAGAAACTAT 
CAAGGGGGTA AACAACCTAT AGAATGGGAG AAAATATTCA CAAAGTATAC 
AT C C AACAAA AGTCTAATAT CCAAAATCTA TAAGGAACCT AAC AAG C AAA 
AAGCAAATAA CCCCCTTAAA AAGTGGGCAA AGGACATGAA CAGATACTTC 
TCAAAAGACG TACATGTGGC CCACAAACAT GAAAAAACGC CCATTTCTAA 
T CATC AG AC A AATGCAAATT AGAACCACAA AGAGATACCA TCTCACACCA 
GTCAGAACAG CTTTTGTTAA AATGTCAAAA AATGAGAAAC GTTGGTGAGG 
CTGCAGAGGA AAG C AG AC AC TTGTACACTG TTGGTGAAGG TGTAAATTAA 
TTTAGCGTAG GCACAGTCAG TTTGGAGATT TCTCAGAGAA CTAAGAGTGG 
AACTACCATT AGACCCAGCA ATCTGATGGC TGGATATACG GCCAAAGGAA 
AATAAATCAT TCTGCCAAAA GAACATATGT ACCTGTATGT TCATTGTGGC 
ACCATTCACA ATAGCAAAAA CATTGAATCA ACTCATGTGC CCATCAGTGG 
CGGACTAGAA AAGAAAAGAA AATATGGTAC ATAGCCATCA TGGAATACTA 
TGCACCCATT AAAAATAATG AAATAATGTC TTTGCAACAA CATGAATGTA 
GCTGGAGGGC ATTATCCTAA GCAAACTAAC ACAGAAACAG AAAACCAAAT 
ACTGCGTGTT CTCACGCAGT GAGAGTGGGA GCTGAACATC AAGTACACAT 
GGATGTAAAG ATGGCAACAA TAGACATGGG TCTACTAGAG GTGGTGGTGC 
GGCAGGGTGG GGGTGGGGGT TGTGTGGCAG AGGAACAGCT GAAAAACTAC 
CTATTTGATA CTATACCCAG CACCTGGGAA ACGGGTTCAG TCATACCCCA 
AACCTCAGCA TCACACAGTA TACCTTTCTA ACAAACTTAC ACATGTATTC 
TGTGATTCTA AAATAAACAT TGAAAAATAA AAAAAAAACT GACATGGTTT 
GTACTGTTTA ATCTGACATA ATGGCTAGGG GAAATGAAGT CTGCAGAATG 
GCTGTTTACG GATGTTGTTG TTGTTGTTGA GATGAGGTCT CACTATGTTG 
CCCAGGCTTG AACTCCTGGC CTCAAGCAGT CCTCCTGCCT TGACCTCCCA 
AAATGTTGAG ATTACATGCA TGAGCCATTG CCAAAACGGC TATTTGGATT 
GCTGTTAAGG TTATTACATT CTCTGTGTAG TAAGACCTTG AAGGAGAAGG 
ATTTGAGATC AGGAGTTTAA GAAAAAATGT TAATCTAGGA AGAGAGGATA 
ATTTCTGTGT TTGGCCAGTT GCAATGGCTC ACGCCTGTAA TCCCAGCGTT 
TTGGGAGGCC GAGCTGGGCA GATCACTTGA GCTCAGGAGT TTGAGAAGAG 
CCTGGGCAAC ATGGTGAAGC CCCGTCTCTA CTGAAAATAC AAAAATTGGC 
TGGGCGTGGT GGCAGGTGCC CACTGTAATT TCAGCCACTC AGGAGACTGA 
GGCAGGAGAA TTGCTTGAAC CTGGAAAGTG GAGGATATGG TGAGCCTAGA 
TCGCGCCACT CTACTCCAGC CTGGGAGACT CCATCTCAAA AAAAAAAGAA 
AACAAGATGC TGAAATGAAG TAATTACCAC AGTCAATGTG ATCCTATAAC 
TTTGTTTTCT TTTAGAGATG GGGTCTCCCT CTGTCACCCA GGCTGGAGTG 
CAGTGGTGCA TCATAGCTTT CTGCAGCCTC CACCTCCTGG GCTCAGGTGG 
TCCTCTTGCC TCAGTGTTCC GAGTAGTTAG GACTGACTGC AGGTGCATGC 
TGCTATGCCT GGCTAACTTT AAAATTTTTT TGTAGAGGCG GGGTCTTGCT 
ATGTTGCCTA GGCTGGTCTC CAACTCCTGA TCTCAATCAG TCCTCCTGCC 
TACCTTCCCA GAGCGCTGGG ATTACAGGTG TGAGCCATCG CACCTAGCCA 
AT CT CAT AAC ATTTTATGAC TAGCAAACTT AGTAGTTCTG ATTCAGGCAT 
AAATCAGTTG GTGGGGTTAT ACAAGGTTGG GTGAGTTTTT CTAGATTTCT 
AAG AG AC CAT GTTGAAATAC TTGGCTCTGT CTCAGTAAGG GATAGAGAGA 
AGCAAAGGTG TGGGTAAAGG TTATGAGCAG ACATGTAAAG GGGGCAAATT 
AAAGTGTTTA GGGAAGGTGA AACAGTTCCA AATCATAATA TAGATCCCAA 
GCCTCACCAA GAAGTGAAGG AGAGGGTAAT TGTGCGATAG TTCTCAGGAC 
TGAGACCTCA AGGTGTAAGA CGAATCTTTA TTGTGGGTGG TCCTCTTTGA 
GAAAAAGAAC AAAAAAGAAA AATGTGAAAT GGGTGCTAAA TTTCAGAGCA 
GAATATACAT ATATGTGTAT GTGTGTGTGT ATATATATAT ATATTTTTAT 
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GTAGCAAATT CCAGTTTACA AAGGGCTTTT ATGTGTTTTG CATC ATT ACA 
ACAGTTCTGT GATGATGTGG ATGTGGTGGA TATTGTAATT CCACATCCCA 
GAGGATAAAA TTGAGGCACA CCATAAGGTA GCTGACCAGA GATCATGCAG 
TATATGGTAG AATGGCAACT TGAGGCCAGT TCTTAGGTGT GTTTGTTGTC 
CTTATTCAGA ATGGAACAAT GTGGTTTATT GTACAAAAAT TTAAAAATGA 
ATGTCGAAAA GTAGAAATTA TTACCCAAAT CTCACCCTCT GTGGTTGTTT 
GCTATGTGAT CTTCCAGACT CACATATACA TGTAGATAAT TTTTTTTTTT 
TTTTTTGAGA CGGAGTCTTG CTCTGTCGCC CTGGCNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNCAAAAAT TAGCTGGGCG TAGTGGTGGG CGCCTGTGGT CCCAGCTACT 
TGGGCAGCTG AGGCAGGAGA ATCGCTTGAA CCTGGGAGGT GGAGGTTGCA 
GTGAGCCAAG ACCATGCCAC TGCACTCCAG CCTGGGCAAC AGAGCAAGAC 
TCTTGTCTTA AAAAACAAAA AAGTGTACCC AGTTGAGCTG ATTCTTTATC 
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25201 TTTTTTTCAC TGGAGAACTA AGTATACAGG TGAGAAAAGA CGAGATATTT 

25251 ATACCCGAGA GAATTGATGG TGAAATCCAT TTTTTTGGAT CAGAACTTCC 

25301 CCAAACAGTG TCCTTCAAAT AGGGTTCAGG GGTGCTAAGA TATTTATCCC 

25351 CTCAACCCTT GGGGTTCACT CCAGTATGGC ATATAAATAT TGTATCACTT 

25401 TCTATGTGTG GGGAGCAGTG CTCCAGGTGA CCTTCCTTCC TTTCCTTCTA 

25451 GGGGAGGAAG TTGCCTTGTA CTGTGCCAAA TATCTTCCTG ATATCATCAA 

25501 AGATCAGAAG GCCTACAAGG AAGGCAAGCT ACAGAAGGTC TGTCTGCTTA 

25551 CACCGCCCAT TCCTCACTTG TGTAGGCTTT TCCCTTGTTC TCTAGCCCTT 

25601 GGGCTTTTCC TTTCTTTTTG TCCTCTAGCT GCTGCTGCTT ATTTACTCTT 

25651 GAAGAATTCT GTTCCTAAAA CGAGCTTATT GGCCGCCTTT TAGACTTGCC 

25701 TTATTATTCC TAGGCCTCTG AGCTGTTTTT ATCTGTGAGT GTCTCTTAGT 

25751 GTGGTGGCTC ACACTCTTAA TTTGTATTCC ATCCTTGTGC TCAGGATTGT 

25801 ATATAGGGAG TTCATTTTGT ACTAGTCTTA GACTATTTTG CTTATATTCA 

25851 GGCTTTAGAA GATGCCTTCT TGGCTATTGA CGCCAAATTG ACCACTGAAG 

25901 AAGTCATTAA AGAGCTGGCA CAGATTGCAG GGCGACCCAC TGAGGATGAA 

25951 GATGAAAAAG AAAAAGTAGC TGATGAAGAT GATGGTGAGT GTGGCATCCC 

26001 TTGTTTGAGG GGAAATCAGC ATTTTAAGAA ATATTCTTTA ATATTACTTA 

26051 TCAATTCTAA GATAGGATGG CTTTCTAGGG ACCTGGGGAG TCCTTATGTT 

26101 AAAGAAACCT ATGATGTTCT CCTGCATTGT ATGTGGTTAT GAAAAGGAGG 

2 6151 GAGAGAATTA TCTTCGTTGA GTGGCATCTG AGCTGTAAGC ATTGTATATA 

2 6201 CATTATCTTT TGTCATTGTG ATGGGGTCTT CCTGGTTCCT GCTAGTATTT 

26251 ATGTGCTTTT TTTTCCCCTC AAGACTGGAG CAGTTATTAG CCCCAATAGC 

26301 CAAT CAT T AA GCCTAAATCC TAATTCACAG TAGCATTGTG GGCTTCCTGG 

26351 ATCCTCAGCC AGAATAGGGT TTTTACAACT TAACAATAAA AAATGAGACG 

26401 TCAGAGGGGA AGTATAGTAA CTAGTGTTGT TTTGATTAAG AAGGGGATGA 

264 51 AACACAAAAA CCAAAAGAAG TCTGTGGAGG AGGAGGAGCT AGGGCATGTT 

2 6501 CTTCTGAGAC TTGAGCGAGA GGAACCTTGG GAGTGGGAGG TTGTGGGGAA 

2 6551 GTTAGAGGCT GCAAGGGCTG TTGAGGTAGT GAGAGGGACG GATCCCATGA 

2 6601 GGAGTCTGGC ATGGGGGCTC TGATTTAGCC TCTTCCCTGC AGTGGACAAT 

26651 GAGGAGGCTG CACTGCTGCA TGAAGAGGCT ACCATGACTA TTGAAGAGCT 

26701 GCTGACACGC TACGGGCAGA ACTGTCACAA GGGCCCTCCC CACAGCAAAT 

26751 CTGGAGGTGG GACAGGCGAG GAACCAGGGT CCCAGGGCCT CAATGGGGAG 

26801 GCAGGACCTG AGGACTCAAC TAGGGAAACT CCTTCACAAG AAAATGGCCC 

26851 CACAGCCAAG GCCTACACAG GCTTTTCCTC CAACTCGGAA CGTGGGACTG 

2 6901 AGGCAGGCCA AGTTGGTGAG CCTGGCATTC CCACTGGTGA GGCTGGGCCT 

26951 TCCTGCTCTT CAGCCTCTGA CAAGCTGCCT CGAGTTGCTA AGTCCAAGTT 

27001 CTTTGAGGAC AGTGAGGATG AGTCAGATGA GGCGGAGGAA GAAGAGGAAG 

27051 ACAGTGAGGT AAGGGCCTGT GAGGGCAGGC AGATGCTGAA GTTGCAGAGA 

27101 GGTCCTGTTT GGTTGCCGTC TGTAGTTTTC AACTCTCTTT CCTTCTCCTA 

27151 TTTTGACATC ATCCCCCAAG ACCCACTGTA TTCTAAGCTT TAGTCTTGAA 

27201 TTCATTGAGC TC CAT CATC A CAGGTACCAT TTGCCTTTTT ACCTCTTCCT 

27251 TTGTTGGTAC TATAACAAGC AGATCTAGTT CTGGCTTTTC AGAGTCTGTC 

27301 TCCTAGAGAG AGAACAAGGA GATAGTTGTT ACCTTGGCTA GTTGACTGTT 

27351 TTCTTCTCTG GAAAATTTAT TTTCTGGCCA CAGTGCCTGA AAGATATTTT 

27401 TGGCTGGCAG CCCTTGCCTT GTCCTGGGCT TTTTGCTAGT GACTGCTAAG 

27451 CCCAGTTCAG GATGTCAGTT GTACTCATGC TAGCCCTTTC CATCCCCCCA 

27501 ATTTTCATGA CCATATACTT GTATCTTTCA GTGTTTTGAG GACCTGTGTT 

27551 CAGTCAGGAC CTCTTGATTC TGAGTATGAG CTGTGGGGAG GGAGGGGATC 

27601 ATCCCAGTCT CAGCAGTCTG GGATCCTCCC CCTGGCAGGA ATGCAGCGAG 

27 651 GAAGAGGATG GCTACAGCAG TGAGGAGGCA GAGAATGAGG AAGATGAGGA 

27701 TGACACCGAG GAGGCTGAAG AGGACGATGA AGAAGAAGAA GAAGAGATGA 

27751 TGGTGCCAGG GATGGAAGGC AAAGAGGAGG TGTGTGGGGA AGGGGAGCAA 

27801 TGAGTCTTGA AAAGCCACAA GGCAGGTGTG AATCCCCTAA TTTTGATTTT 

27851 GAGACAGGGG ATCCCCCTGA TACTTTAGGA TGGAAGTAAT AGTCATGGGG 

27 901 ATTTATTCTG CAAGGGGAAT GAGATGGTAA GCCTTTGGGG TTGAATTATC 

27 951 TAAAAACAAG GGAGAGGGAG TGTGCTGCTG TCTCTAGAAA GATGAAATGT 

28001 GTGCTTCTCC TGTTTGTTAA AGCTCTTTTG GGGGTCCCAG TGAAAGCAAG 

28051 CATAGGTGAA CGATCAGGAG CACATCAGTG AGGAACGCAT GTTCAGAAGC 

28101 CCCCATGATG CTCCTTTTCT TCCTCTTAAG CCTGGCTCTG ACAGTGGTAC 

28151 AACAGCGGTG GTGGCCCTGA TACGAGGGAA GCAGTTGATT GTAGCCAACG 

28201 CAGGAGACTC TCGCTGTGTG GTATCTGAGG CTGGCAAAGC TTTAGACATG 

28251 TCCTATGATC ACAAACCAGA GGATGAAGTA GAACTAGCAC GCATCAAGAA 

28301 TGCTGGTGGC AAGGTCACCA TGGATGGGCG AGTCAACGGG GGCCTCAACC 



FIGURE 3, page 9 of 17 



28351 TCTCCAGAGC CATTGGTAAG GGCCAAGAAA CTGGGAAAGA NNNNNNNNNN 

28401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

28451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

28501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

28551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

28601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

28651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

28701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNATGGT GAA ACCCCATCTC 

28751 TACTAAAAAA AAAAAAAAAT ACAAAAATTA GCTGGGCATG GTAGTGCAAG 

28801 CCTGTAATCC CAGCTACTCA GGAGGCTGAG GCAGGAGAAC CGCTTGAATC 

28851 CGGGAGGCGG AGGTTGTAGT GAGCCGAGAT CGTGCCATTA CACTCCAGCC 

28 901 TGGGCTACAA GAGTGAAACT CCGTCTCAAA AAAAAAAAAA CAAAAAAGAC 

28 951 TTAAAATAAA AAGACCAGTG AGTGACTTTC TTAAGGTTCA GCAGTCTGGT 

29001 GGCAGGGTTG AAACTAGAAA AACTAGGACT TAGGACTCAG TTCCCCATTC 

2 9051 CACTAGATTA TGGAACTTTG TAAAGAAGGG AAATGAATGG CAAGGTTTGA 

29101 CCTGCCACAA ACACAAGTCT GTGGGAAGTA TCCAAACTGC TCATCAACCA 

2 9151 TTCCTTTACT CCAGGGGACC ACTTCTATAA GAGAAACAAG AACCTGCCAC 

29201 CTGAGGAACA GATGATTTCA GCCCTTCCTG ACATCAAGGT GCTGACTCTC 

2 9251 ACTGACGACC AT G AAT T CAT GGTCATTGCC TGTGATGGCA TCTGGTGAGC 

29301 ACTGGCAGAA TGCCCTAAAT TCCCCTTTCT GCAGCATGTC TTCTCTTATA 

2 9351 GGACTCAGGG CACCTCTAGG ATTAGAGCCT AGGCAGACCT AGGCCTCTTG 

2 9401 GTGGGTGAAG AGCACCCAGA CTAAGGCAGA GCTGAGAATT TCTGTAGTTA 

2 9451 TTTACACTGG CCTGGGCCAC CACCTCTGTC CATACTCCTC TACGCTGCCT 

2 9501 TAGTGAGACT GGAAGATTCT GACTGTTGTT CTTGACCCCA GGAATGTGAT 

2 9551 GAGCAGCCAG GAAGTTGTAG ATTTCATTCA AT C AAAG AT C AGCCAGCGTG 

2 9601 ATGAAAATGG GGAGCTTCGG TTATTGTCAT CCATTGTGGA AGAGGTGAGT 

29651 ACCAGGGTGG AGAAGAGAGG GTGTCTGGTC TGCACAGCCA GGGTT (SEQ ID NO: 3) 
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20532 C T 

21118 G T 

21211 C G 

21212 T C 
21238 - A 
26483 G A 



Context : 



DNA 

Position 

294 4 AGATGTGAATAACTTTTTAAATTTTAATATTTAAAATATCTGATGTGGGAAGCCTCTTTT 
GGCTAGGAGTTTGACAGTGAAAGGAACCCCGGGCAGAGTCTGTTTCACATTTTGGTTGCC 
TGGCCTTGGGCTCTTGCATGTTAATTTCAGAGGCTGGACCCGACCTCCAGGAGTTGTCAC 
TCATTTGCACTCTTTTCAGGGCCTTTTACTAACTTCGGAAAACTTGAATTATGTCAGTCC 
CTAGGTTTTCCTTTTTATTATGCTTCGTTTTCTTTCTCTCTTAGGATTTCTCTAAAACTT 

ATCAGTAATTCTCATTTGCCTGTAATTGTAGTTTATAGTTTCTTTGATTGTAAGTCATTT 
CATCAATTTTTTTCATCACAACCTACCTACAAAGGGCTTTTCTAGAAAATTTTACTCTGG 
ACAAAAGGGGAAAAGAAAAATATTGGGGGGAAAGTAGTAGTATTAGGTAAAACTTGATGT 
GAAACTACAAAAGAGAAGAGGGAAAACTGCGGTAGGGAGGAAAGGGAGGAAGACGGGTTA 
ACCGTGGCTTTGTGAAGAGCATTCTGAAGTCTAGGCAAAAGGGCCAGGGAAATACTCTGT 

3232 TCTCTAAAACTTAATCAGTAATTCTCATTTGCCTGTAATTGTAGTTTATAGTTTCTTTGA 
TTGTAAGTCATTTCATCAATTTTTTTCATCACAACCTACCTACAAAGGGCTTTTCTAGAA 
AATTTTACTCTGGACAAAAGGGGAAAAGAAAAATATTGGGGGGAAAGTAGTAGTATTAGG 
TAAAACTTGATGTGAAACTACAAAAGAGAAGAGGGAAAACTGCGGTAGGGAGGAAAGGGA 
GGAAGACGGGTTAACCGTGGCTTTGTGAAGAGCATTCTGAAGTCTAGGCAAAAGGGCCAG 
[G, A] 

GAAATACTCTGTCTGGTATTGAGGGTTTCTCCACCTACCGGGTGGGCTTCAGGTAACAGC 
GAAATACTGTCTCCCTTGGGAATTGTTTCAGATCCCTCGCTCCTCCTGTGGTTAGCTCTG 
y GAATGCCAGTATGAACTTCAATGTTTTGTTTTCCGATTCAAATTTTATATTCATAACTGA 

CCTTAATAACAATTTTACAATTAGGTATAAAATTTCAGGATCCTAGTGTATCCTATAGTT 
CATCTCATCTGCTTTGGCTCCCTTTTTTTTTTTTTGAGACAGAATTTCGCTCTTGTTGCC 

723 6 GACAGGATTTCACTATGTTGGCCAGGCTGGTCTCAAGACTCCTGACGTTGTGATCCACCT 
GCCTCGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCTACCGTGCCCGGCCAGGAATT 
TTTTGTGCTATAAATCATATTTTCCTTTTATTAAAGGCAGTGTCAATATCTATAGTATAA 
TTTTGAGGAGGCTGGCTATTTATTGCTGTGTAGAAGCTGGCTTATTAGTGGTCAAGGGGT 
CATCTAGAATTGACTATAAAGATAGTATTGAGCAGAAAATTCTTAAAATACCTGCATATT 
[T,A] 

GT T T C AGT C AT T AAAT T AAT GGAAAAAAT AT AAAAAG AAAT AT C AC AAGT AT G CT AT G G G 
GTTCTACCTTAGGGCATTGAAGGTTGAAAAACATTTATTTTCTTATCTTCAGATTAGCAT 
CTCATATCAATAGCCAATAGCTTAAAGCGCTTTTTACTTACTAAACCAGGTCAGAATTTC 
TCTCTCTCTCTCTTTTTTTTTTTTAGACAGAGTCTCTGTCACCCAGTCTAGAGTGCAGTG 
GTGTGATCTCGGCTCACTGGCCTTTCTGTTTAAGTGATTCTCCGGCCTCAGCCTCCTGAG 

8187 GTTGCCCAGGCTGGAGTGCAATGGTGCCATCTCGGCTCACTGCAGCTTCTGCCTCCTGGG 
TTCTAGCAATTTGCCTACCCCAGCCTCCCAAGTAGCTGGGATTACAGGCACTCGCCACCA 
TGCCCAGCTAACTTTTTTGTATTTTTAGTAGAGAAGGGGTTTTCCCATGCTGGCGAGTCT 
GGTCTTGAACTTCTGGCCTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGAATT 
ACAGGCATGAGCCACTGTGCCCGGCCTTTTTATTTTTTAAATTATTTATGTATTTATTTT 
[T,A,G] 

AGACAGGATCTCACTCTTGCCCATGCTTGAGTGGTAAGGAGTATGGGATTTCTTGTGCCG 
GTTCCTTCACATCCTCACTACACTTATCTGCCTTTACAGTGGCTCGATCATAGTTCACTG 
CATAGCCTTCTGGGCTCAAGGGGTCTTCCAGCCTCAGCCTAATATAGGCACATGCCACCA 
TGACTGGCTAATTTTTTTTTTTTAAGTTTTTTTTTGTAGAGATAGGGCCTTGCAGTGTTG 
CCCAGGCTGAGGAATTTTATTTATGTTTATTTTATTTATTTATTTATTTATTTATTTATT 

8187 GTTGCCCAGGCTGGAGTGCAATGGTGCCATCTCGGCTCACTGCAGCTTCTGCCTCCTGGG 
TTCTAGCAATTTGCCTACCCCAGCCTCCCAAGTAGCTGGGATTACAGGCACTCGCCACCA 
TGCCCAGCTAACTTTTTTGTATTTTTAGTAGAGAAGGGGTTTTCCCATGCTGGCGAGTCT 
GGTCTTGAACTTCTGGCCTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGAATT 
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ACAGGCATGAGCCACTGTGCCCGGCCTTTTTATTTTTTAAATTATTTATGTATTTATTTT 
[T,-,G] 

AGACAGGATCTCACTCTTGCCCATGCTTGAGTGGTAAGGAGTATGGGATTTCTTGTGCCG 
GTTCCTTCACATCCTCACTACACTTATCTGCCTTTACAGTGGCTCGATCATAGTTCACTG 
CATAGCCTTCTGGGCTCAAGGGGTCTTCCAGCCTCAGCCTAATATAGGCACATGCCACCA 
TGACTGGCTAATTTTTTTTTTTTAAGTTTTTTTTTGTAGAGATAGGGCCTTGCAGTGTTG 
CCCAGGCTGAGGAATTTTATTTATGTTTATTTTATTTATTTATTTATTTATTTATTTATT 

8187 GTTGCCCAGGCTGGAGTGCAATGGTGCCATCTCGGCTCACTGCAGCTTCTGCCTCCTGGG 
TTCTAGCAATTTGCCTACCCCAGCCTCCCAAGTAGCTGGGATTACAGGCACTCGCCACCA 
TGCCCAGCTAACTTTTTTGTATTTTTAGTAGAGAAGGGGTTTTCCCATGCTGGCGAGTCT 
GGTCTTGAACTTCTGGCCTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGAATT 
ACAGGCATGAGCCACTGTGCCCGGCCTTTTTATTTTTTAAATTATTTATGTATTTATTTT 
[-,A,G] 

AGACAGGATCTCACTCTTGCCCATGCTTGAGTGGTAAGGAGTATGGGATTTCTTGTGCCG 
GTTCCTTCACATCCTCACTACACTTATCTGCCTTTACAGTGGCTCGATCATAGTTCACTG 
CATAGCCTTCTGGGCTCAAGGGGTCTTCCAGCCTCAGCCTAATATAGGCACATGCCACCA 
TGACTGGCTAATTTTTTTTTTTTAAGTTTTTTTTTGTAGAGATAGGGCCTTGCAGTGTTG 
CCCAGGCTGAGGAATTTTATTTATGTTTATTTTATTTATTTATTTATTTATTTATTTATT 

9172 AAGGATATTTTACACAAAGTTCTATTGTCTGATTCCTTAGCAGTTGTTACTACTGTTTCC 
CTAACCTCTAATCTTCTATTGGGTTATTAGTCTTAGAATTGAATTTTGAGAGGTAAGGGC 
TTGAATTTGAACATAGAAATTTATACAGGTCTGATCAGTAGTTCTTGACATTGTATTATC 
TGGAAACAAATCTTTAGAACTGAGCTTAAGATGTTTAATGACATTTTGTAGACAGAGTAT 
GATTTCAGTGTAGTTGTTTTTGTTTCTTTCTAGATCTAGTTCAGAGATGAAGTATATCAA 
[C,A] 

TTTTTTTTCCTTTTTGACCCi\ATGCTAGCAGAAAAACAACACCTTTTAATCATATTTAGT 
ATTTGAAAATGTGTATACAGGTTCCTTTTTATTTTATTTATTTCTTACAGGTTCCTTTTT 
AATCAGCTTTATTGAGATAGAGTTCATATACTGTATGGTTCATACCACATATGGTTCATA 
T AC CATACAGT AT ATGAACT CACTT T AAGAGT ATAAT T CAGTGGGTT T AAGGGTATAAT T 
CATTCATTTTAAGGGTATAATTCAGTGGCTTTTAGTATATTTTCTTTTTTTTCTTCTTTT 

9173 AGGATATTTTACACAAAGTTCTATTGTCTGATTCCTTAGCAGTTGTTACTACTGTTTCCC 
TAACCTCTAATCTTCTATTGGGTTATTAGTCTTAGAATTGAATTTTGAGAGGTAAGGGCT 
TGAATTTGAACATAGAAATTTATACAGGTCTGATCAGTAGTTCTTGACATTGTATTATCT 
GGAAACAAATCTTTAGAACTGAGCTTAAGATGTTTAATGACATTTTGTAGACAGAGTATG 
ATTTCAGTGTAGTTGTTTTTGTTTCTTTCTAGATCTAGTTCAGAGATGAAGTATATCAAC 
[T,-] 

TTTTTTTCCTTTTTGACCCAATGCTAGCAGAAAAACAACACCTTTTAATCATATTTAGTA 
TTTGAAAATGTGTATACAGGTTCCTTTTTATTTTATTTATTTCTTACAGGTTCCTTTTTA 
AT C AG CT T TAT T GAG AT AG AG T T C AT AT AC TG T AT G GT T CAT AC C AC AT AT G G T T CAT AT 
AC CATACAGT AT AT GAACTC ACT TT AAGAGT AT AATTCAGTGGGTTTAAGGGTATAATTC 
ATTCATTTTAAGGGTATAATTCAGTGGCTTTTAGTATATTTTCTTTTTTTTCTTCTTTTT 

9180 TTTACACAAAGTTCTATTGTCTGATTCCTTAGCAGTTGTTACTACTGTTTCCCTAACCTC 
TAATCTTCTATTGGGTTATTAGTCTTAGAATTGAATTTTGAGAGGTAAGGGCTTGAATTT 
GAACATAGAAATTTATACAGGTCTGATCAGTAGTTCTTGACATTGTATTATCTGGAAACA 
AATCTTTAGAACTGAGCTTAAGATGTTTAATGACATTTTGTAGACAGAGTATGATTTCAG 
TGTAGTTGTTTTTGTTTCTTTCTAGATCTAGTTCAGAGATGAAGTATATCAACTTTTTTT 
[T,-] 

CCTTTTTGACCCAATGCTAGCAGAAAAACAACACCTTTTAATCATATTTAGTATTTGAAA 
ATGTGTATACAGGTTCCTTTTTATTTTATTTATTTCTTACAGGTTCCTTTTTAATCAGCT 
TTATTGAGATAGAGTTCATATACTGTATGGTTCATACCACATATGGTTCATATACCATAC 
AGTATATGAACTCACTTTAAGAGTATAATTCAGTGGGTTTAAGGGTATAATTCATTCATT 
TTAAGGGTATAATTCAGTGGCTTTTAGTATATTTTCTTTTTTTTCTTCTTTTTTTCTTTT 

13351 TTTTCCCCCATTGAATTATCTTGGCATCATTGTCAGAGATAAATTGACCGTAAATGTGAG 
GGTTTTATTTCTGAACTCTCAAGTCCATTTCATTGGTCTACATGTCCCTATGCCAGTAAT 
ACACTATCTTGGTTACTGTAGCTTTTTAGTACGTTTTGAAATGTTTTTAAAATTTGTTTT 
TCATCTAAATTTTAGGATTAATTTGTCAATTTCTGCACAAAAGGCACCTGGGTTTCTATA 
GGGGTTATGCAGAATCTGTAGATCAACTGGGGGAGTATTACAGGCATGAGCCACCGTGCC 
[T,C] 

GGCTGACTGAGTTTTTCATAGATGTACTCTATCAGGTTTAGGAAGTTCCCTTTTATTCCT 
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AGGTTGTTGAGTCTATTTTATATTACTTTTTTAGAGACAGTCTTGCTCTGTCCCTCAGGC 
TGTAGCACAGTGGCTCAATCATAGCTCACTGCAGCCTTGAACTCCTAGGTTCAAGAGATC 
CGCCTGCCTCAGCCTTCTTAGTAGCTGGGATTACATGCATGCACCACCATACTGGGCTAA 
TTTTTTAAAATTTTTTATAGAGACAGGGTCTTATTACTATGTTGCCCAGACTGGCATTGA 

15100 GTTGTGTACCCAGTTTCCAGAACTCTTCATCCTACAGAACTGAAACTCCATACCCATTAA 
ATGAGTCCCCATTCTCTTTCCCCCAGCTCATGGCAAACAGCATTCTATTTTCAGTCTCTA 
TGAATTTGATTAGTTTAGATACTTCATACTGTAAGTGGAATCATATGGTATTTGTCTTTT 
AGTGACTGCCTAATTTAAAAAAAATTTTTTTGAGACGGAGTCCTGCTCTGTCGCCCAGGC 
TGGAGTGCAGTGGCACCATCTCTGCTCACTGCAACCTCCACCTCCCAGGTTCAAGTGATT 
[G,C] 

TCCTGCCTCGGCCTCCCACGTAGCTGGGATTACAGGTGCTCGCCACAACACCCGGCTAAT 
TTTTGTATTTTTAGGTAGAGACTGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCC 
TGACCTCAAATTATCCACCTGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCC 
ACTGTGCCCAGCCTCCATGTTGTTTTTCACAACACCTGTATCATTTACATTTCCACCAAC 
AGTACACAAGAATTTCAGTTTCTCCACATCCTTGCTAGCAGTTGTTATTATCTGTTTTTT 

15255 TGGAATCATATGGTATTTGTCTTTTAGTGACTGCCTAATTTAAAAAAAATTTTTTTGAGA 
CGGAGTCCTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCACCATCTCTGCTCACTGCAAC 
CTCCACCTCCCAGGTTCAAGTGATTGTCCTGCCTCGGCCTCCCACGTAGCTGGGATTACA 
GGTGCTCGCCACAACACCCGGCTAATTTTTGTATTTTTAGGTAGAGACTGGGTTTCACCA 
TGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAAATTATCCACCTGCCTTGGCCTCCCA 
[A,C] 

AGTGCTGGGATTACAGGCGTGAGCCACTGTGCCCAGCCTCCATGTTGTTTTTCACAACAC 
CTGTATCATTTACATTTCCACCAACAGTACACAAGAATTTCAGTTTCTCCACATCCTTGC 
TAGCAGTTGTTATTATCTGTTTTTTTTTAATGGTTTCTTTTTTCCTTTTTCTTTTTTTTT 
TTTTTTGAGACGGTCTTATTCTTGCTCATGCTGGAGTGCAGTGGTGCAATGTGATAGCTC 
ACTGCAGCCTCAACCTCTGGGCTCAAGCAGTCCTGCCACCTCAGCCTCCACAT AGGTGGG 

18 996 GTAGTGGCTCATGCCTAAATCTCAGCACTTTGGGAGGCTGAGGCGGGAGGATCACCTGAG 
GTCGGGAGTTTGAGACCAGCCTGGGCAACATGGTGAAACCCCGTGTCTACTAAAAATATA 
AAAACTAGCCAGTCATGGTGGCACACACCTGTAATCCCGGCTACTTGGGAGGCTGAGGCA 
TGAGAATCACTTGAACCTGGGCTGCGGAGGTTGTAGTAAGCTGAGATCATGCCACTGTAC 

TCCAGCCTGGGTGACAGGGAGACATTCTCTCTCTCAAAAAAAAAAAAAAAACAAAAAAAA 
[A,C] 

AAAAC AAAC C AAC AAAACAAAGT AAT C CAG G AAC AAC AAC AT GAT G AAG G ACT G C AT G C A 
GGACTCAGTGATGGATGGTGGAAGACAGCCAGGAAGTTAAGCATGACTCTGGTATTAAGT 
GTTGTCTGGGAGAGTTAAGATTCCATTTACAGAAATAAGACCTGTAGGGGAAGCTCTTGA 
TTTTTTTTTTTTTGCAGACTGCTGATTTCCTGATTACATGTGTTAAGTTTGAGGTATAGA 
GAGAAAGAACATCCTGGCCGGGTGCAGTGGCTCACACCCGTAATCCCAGCACTTTGGGAG 

18996 GTAGTGGCTCATGCCTAAATCTCAGCACTTTGGGAGGCTGAGGCGGGAGGATCACCTGAG 
GTCGGGAGTTTGAGACCAGCCTGGGCAACATGGTGAAACCCCGTGTCTACTAAAAATATA 
AAAACTAGCCAGTCATGGTGGCACACACCTGTAATCCCGGCTACTTGGGAGGCTGAGGCA 
TGAGAATCACTTGAACCTGGGCTGCGGAGGTTGTAGTAAGCTGAGATCATGCCACTGTAC 
TCCAGCCTGGGTGACAGGGAGACATTCTCTCTCTCAAAAAAAAAAAAAAAACAAAAAAAA 
[A,C] 

AAAAC AAAC C AAC AAAACAAAG T AAT C C AGG AAC AACAAC AT GAT G AAGG AC T G C AT GC A 
GGACTCAGTGATGGATGGTGGAAGACAGCCAGGAAGTTAAGCATGACTCTGGTATTAAGT 
GTTGTCTGGGAGAGTTAAGATTCCATTTACAGAAATAAGACCTGTAGGGGAAGCTCTTGA 
TTTTTTTTTTTTTGCAGACTGCTGATTTCCTGATTACATGTGTTAAGTTTGAGGTATAGA 
GAGAAAGAACATCCTGGCCGGGTGCAGTGGCTCACACCCGTAATCCCAGCACTTTGGGAG 

18 996 GTAGTGGCTCATGCCTAAATCTCAGCACTTTGGGAGGCTGAGGCGGGAGGATCACCTGAG 
GTCGGGAGTTTGAGACCAGCCTGGGCAACATGGTGAAACCCCGTGTCTACTAAAAATATA 
AAAACTAGCCAGTCATGGTGGCACACACCTGTAATCCCGGCTACTTGGGAGGCTGAGGCA 
TGAGAATCACTTGAACCTGGGCTGCGGAGGTTGTAGTAAGCTGAGATCATGCCACTGTAC 
T C C AGC C T G GG T GACAGGGAGAC AT TCTCTCTCT C AAAAAAAAAAAAAAAA C AAAAAAAA 
[A P C] 

AAAACAAACCAACAAAACAAAGTAATCCAGGAACAACAACATGATGAAGGACTGCATGCA 
GGACTCAGTGATGGATGGTGGAAGACAGCCAGGAAGTTAAGCATGACTCTGGTATTAAGT 
GTTGTCTGGGAGAGTTAAGATTCCATTTACAGAAATAAGACCTGTAGGGGAAGCTCTTGA 
TTTTTTTTTTTTTGCAGACTGCTGATTTCCTGATTACATGTGTTAAGTTTGAGGTATAGA 
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GAGAAAGAACATCCTGGCCGGGTGCAGTGGCTCACACCCGTAATCCCAGCACTTTGGGAG 



20532 TAGCCATCATGGAATACTATGCACCCATTAAAAATAATGAAATAATGTCTTTGCAACAAC 
ATGAATGTAGCTGGAGGGCATTATCCTAAGCAAACTAACACAGAAACAGAAAACCAAATA 
CTGCGTGTTCTCACGCAGTGAGAGTGGGAGCTGAACATCAAGTACACATGGATGTAAAGA 
TGGCAACAATAGACATGGGTCTACTAGAGGTGGTGGTGCGGCAGGGTGGGGGTGGGGGTT 
GTGTGGCAGAGGAACAGCTGAAAAACTACCTATTTGATACTATACCCAGCACCTGGGAAA 
[C,T] 

GGGTTCAGTCATACCCCAAACCTCAGCATCACACAGTATACCTTTCTAACAAACTTACAC 
ATGTATTCTGTGATTCTAAAATAAACATTGAAAAATAAAAAAAAAACTGACATGGTTTGT 
ACTGTTTAATCTGACATAATGGCTAGGGGAAATGAAGTCTGCAGAATGGCTGTTTACGGA 
TGTTGTTGTTGTTGTTGAGATGAGGTCTCACTATGTTGCCCAGGCTTGAACTCCTGGCCT 
CAAGCAGTCCTCCTGCCTTGACCTCCCAAAATGTTGAGATTACATGCATGAGCCATTGCC 

21118 GCATGAGCCATTGCCAAAACGGCTATTTGGATTGCTGTTAAGGTTATTACATTCTCTGTG 

TAGTAAGACCTTGAAGGAGAAGGATTTGAGATCAGGAGTTTAAGAAAAAATGTTAATCTA 

GGAAGAGAGGATAATTTCTGTGTTTGGCCAGTTGCAATGGCTCACGCCTGTAATCCCAGC 

GTTTTGGGAGGCCGAGCTGGGCAGATCACTTGAGCTCAGGAGTTTGAGAAGAGCCTGGGC 

AACATGGTGAAGCCCCGTCTCTACTGAAAATACAAAAATTGGCTGGGCGTGGTGGCAGGT 
[G,T] 

CCCACTGTAATTTCAGCCACTCAGGAGACTGAGGCAGGAGAATTGCTTGAACCTGGAAAG 
TGGAGGATATGGTGAGCCTAGATCGCGCCACTCTACTCCAGCCTGGGAGACTCCATCTCA 
AAAAAAAAAGAAAACAAGATGCTGAAATGAAGTAATTACCACAGTCAATGTGATCCTATA 
ACTTTGTTTTCTTTTAGAGATGGGGTCTCCCTCTGTCACCCAGGCTGGAGTGCAGTGGTG 
CATCATAGCTTTCTGCAGCCTCCACCTCCTGGGCTCAGGTGGTCCTCTTGCCTCAGTGTT 

21211 AGGAGTTTAAGAAAAAATGTTAATCTAGGAAGAGAGGATAATTTCTGTGTTTGGCCAGTT 
GCAATGGCTCACGCCTGTAATCCCAGCGTTTTGGGAGGCCGAGCTGGGCAGATCACTTGA 
GCTCAGGAGTTTGAGAAGAGCCTGGGCAACATGGTGAAGCCCCGTCTCTACTGAAAATAC 
AAAAATTGGCTGGGCGTGGTGGCAGGTGCCCACTGTAATTTCAGCCACTCAGGAGACTGA 
GGCAGGAGAATTGCTTGAACCTGGAAAGTGGAGGATATGGTGAGCCTAGATCGCGCCACT 
[C,G] 

TACTCCAGCCTGGGAGACTCCATCTCAAAAAAAAAAGAAAACAAGATGCTGAAATGAAGT 
AATTACCACAGTCAATGTGATCCTATAACTTTGTTTTCTTTTAGAGATGGGGTCTCCCTC 
TGTCACCCAGGCTGGAGTGCAGTGGTGCATCATAGCTTTCTGCAGCCTCCACCTCCTGGG 
CTCAGGTGGTCCTCTTGCCTCAGTGTTCCGAGTAGTTAGGACTGACTGCAGGTGCATGCT 
GCTATGCCTGGCTAACTTTAAAATTTTTTTGTAGAGGCGGGGTCTTGCTATGTTGCCTAG 

21212 GGAGTTTAAGAAAAAATGTTAATCTAGGAAGAGAGGATAATTTCTGTGTTTGGCCAGTTG 
CAATGGCTCACGCCTGTAATCCCAGCGTTTTGGGAGGCCGAGCTGGGCAGATCACTTGAG 
CTCAGGAGTTTGAGAAGAGCCTGGGCAACATGGTGAAGCCCCGTCTCTACTGAAAATACA 
AAAATTGGCTGGGCGTGGTGGCAGGTGCCCACTGTAATTTCAGCCACTCAGGAGACTGAG 
GCAGGAGAATTGCTTGAACCTGGAAAGTGGAGGATATGGTGAGCCTAGATCGCGCCACTC 
[T,C] 

ACTCCAGCCTGGGAGACTCCATCTCAAAAAAAAAAGAAAACAAGATGCTGAAATGAAGTA 
ATTACCACAGTCAATGTGATCCTATAACTTTGTTTTCTTTTAGAGATGGGGTCTCCCTCT 
GTCACCCAGGCTGGAGTGCAGTGGTGCATCATAGCTTTCTGCAGCCTCCACCTCCTGGGC 
TCAGGTGGTCCTCTTGCCTCAGTGTTCCGAGTAGTTAGGACTGACTGCAGGTGCATGCTG 
CTATGCCTGGCTAACTTTAAAATTTTTTTGTAGAGGCGGGGTCTTGCTATGTTGCCTAGG 

21238 GGAAGAGAGGATAATTTCTGTGTTTGGCCAGTTGCAATGGCTCACGCCTGTAATCCCAGC 

GTTTTGGGAGGCCGAGCTGGGCAGATCACTTGAGCTCAGGAGTTTGAGAAGAGCCTGGGC 

AACATGGTGAAGCCCCGTCTCTACTGAAAATACAAAAATTGGCTGGGCGTGGTGGCAGGT 

GCCCACTGTAATTTCAGCCACTCAGGAGACTGAGGCAGGAGAATTGCTTGAACCTGGAAA 

GTGGAGGATATGGTGAGCCTAGATCGCGCCACTCTACTCCAGCCTGGGAGACTCCATCTC 
t-,A] 

AAAAAAAAAGAAAACAAGATGCTGAAATGAAGTAATTACCACAGTCAATGTGATCCTATA 
ACTTTGTTTTCTTTTAGAGATGGGGTCTCCCTCTGTCACCCAGGCTGGAGTGCAGTGGTG 
CATCATAGCTTTCTGCAGCCTCCACCTCCTGGGCTCAGGTGGTCCTCTTGCCTCAGTGTT 
CCGAGTAGTTAGGACTGACTGCAGGTGCATGCTGCTATGCCTGGCTAACTTTAAAATTTT 
TTTGTAGAGGCGGGGTCTTGCTATGTTGCCTAGGCTGGTCTCCAACTCCTGATCTCAATC 

2 6483 CTGTAAGCATTGTATATACATTATCTTTTGTCATTGTGATGGGGTCTTCCTGGTTCCTGC 
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TAGTATTTATGTGCTTTTTTTTCCCCTCAAGACTGGAGCAGTTATTAGCCCCAATAGCCA 

ATCATTAAGCCTAAATCCTAATTCACAGTAGCATTGTGGGCTTCCTGGATCCTCAGCCAG 

AATAGGGTTTTTACAACTTAACAATAAAAAATGAGACGTCAGAGGGGAAGTATAGTAACT 

AGTGTTGTTTTGATTAAGAAGGGGATGAAACACAAAAACCAAAAGAAGTCTGTGGAGGAG 
[G,A] 

AGGAGCTAGGGCATGTTCTTCTGAGACTTGAGCGAGAGGAACCTTGGGAGTGGGAGGTTG 
TGGGGAAGTTAGAGGCTGCAAGGGCTGTTGAGGTAGTGAGAGGGACGGATCCCATGAGGA 
GTCTGGCATGGGGGCTCTGATTTAGCCTCTTCCCTGCAGTGGACAATGAGGAGGCTGCAC 
TGCTGCATGAAGAGGCTACCATGACTATTGAAGAGCTGCTGACACGCTACGGGCAGAACT 
GTCACAAGGGCCCTCCCCACAGCAAATCTGGAGGTGGGACAGGCGAGGAACCAGGGTCCC 

GENE WISE ALIGNMENT: 

gi | 4505999 I ref [ 1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPEL 

MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSME 
MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEVRRQGPIGW 

genomic 2134 aggtctccaagattggggggcccccctgttgacgtcgtaggaacgcagt 

tgcatcacactagcgagtgccgtctcagtcctagggtctatggagctgg 
gtccccgccgggccgcgcccgcgggccccccgacgcccgggggggcacg 

gi i 4505999 i ref | 50 DSETAMFSVYDGHGG EEVALYCA 

++ + G EEVALYCA 
PLRGGNLTEKESGGW -:G[ggg] EEVALYCA 

genomic 2281 cccggacagagaggtGGTCCTCC Intron 1 TAGGGggggtttg 

ctgggatcaaagggg <1 [2327 :25451]-1> aatctagc 

gggcgtgggagcgag gatcgctc 

gi i 4505999 1 ref | 73 KYLPDI IKDQKAYKEGKLQK ALEDAF 

KYLPDIIKDQKAYKEGKLQK ALEDAF 
KYLPDI IKDQKAYKEGKLQK ALEDAF 

genomic 25478 atccgaaagcagtaggaccaGTCTGTC Intron 2 CAGgtgggt 

aatcattaaaacaaagataa<0 [25538 : 25851 ] -0>ctaact 

attttccatggccgacgagg taatcc 

gi | 4505999 ! ref ! 99 LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 

LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 
LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 

genomic 25870 tgaggataagggaagcgcaggccagggggagagggggg 

tctacatccaattaatcatcggccaaaaaaaatcaaaa 
gttccagctaactaggagtagactgtataaaaattatt 

gi [ 4505999 j ref | 137 DNEEAALLHEEATMTIEELLTRY 

DNEEAALLHEEATMTIEELLTRY 
V: V [gtg] DNEEAALLHEEATMTIEELLTRY 
genomic 25984 GGTGAGTG Intron 3 CAGTGgaggggcccgggaaaaggccact 

<1 [25985 : 26642] -1> aaaaccttaaacctctaattcga 

ctggtaggtagtcgttagggacc 

gi | 4505999 | ref | 161 GQNCHKGPPHSKSGGGTGEEPGSQGLNGEAGPEDSTRETPSQENGPTAK 

GQNCHKGPPHSKSGGGTGEEPGSQGLNGEAGPEDSTRETPSQENGPTAK 
GQNCHKGPPHSKSGGGTGEEPGSQGLNGEAGPEDSTRETPSQENGPTAK 

genomic 26714 gcatcagcccaatgggagggcgtcgcaggggcggtaagactcgagcaga 

gaagaagccagacgggcgaacgcagtagacgcaaccgacccaaagccca 
ggctcgctcccatatgacgaagcgcctggaatgcatgattaaatccacg 

gi|4505999|ref| 210 AYTGFSSNSERGTEAGQVGE PGI PTGEAGPSCS SASDKLPRVAKSKFFE 

AYTGFSSNSERGTEAGQVGEPGIPTGEAGPSCSSASDKLPRVAKSKFFE 
AYTGFSSNSERGTEAGQVGE PGI PTGEAGPSCS SAS DKLPRVAKSKFFE 

genomic 2 68 61 gtagtttatgcgagggcgggcgacaggggcttttgtgacccggatattg 

cacgtccacaggcacgatgacgtccgacgccgccccaatcgtcacatta 
ccactcccgatgtgacattgtctcttgtgtcctactcggtattgcgctg 

gi [45059991 ref | 259 DSEDESDEAEEEEEDSE ECSEEEDGY 
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DSEDESDEAEEEEEDSE ECSEEEDGY 
DSEDESDEAEEEEEDSE ECSEEEDGY 
genomic 27008 gagggtgggggggggagGTAAGGG Intron 4 CAGgtagggggt 

agaaacaacaaaaaaga<0 [27059 : 27638] -0>aggaaaaga 

ctgtgatgggaagactg accgagtcc 

gi j 4 505999 | ref | 285 SSEEAENEEDEDDTEEAEEDDEEEEEEMMVPGMEGKEE 

SSEEAENEEDEDDTEEAEEDDEEEEEEMMVPGMEGKEE 
SSEEAENEEDEDDTEEAEEDDEEEEEEMMVPGMEGKEE 

genomic 27 666 aaggggaggggggagggggggggggggaagcgaggagg 

ggaacaaaaaaaacaacaaaaaaaaaatttcgtagaaa 
ctggagtgatgtccggtagctaaaaaggggaggacagg 

gi ! 4505999 | ref 1 323 PGSDSGTTAVVALIRGKQLIVANAGD 

PGSDSGTTAVVALIRGKQLIVANAGD 
PGSDSGTTAVVALIRGKQLIVANAGD 

genomic 27780 GTGTGTG Intron 5 AAGcgtgagaaggggcacgactaggaggg 

<0 [27780 : 28130 ] -0>cgcaggcccttcttggaatttcacga 

tctcttaagggcgaaggggtaccaac 

gi | 4505999 | ref | 349 SRCWSEAGKALDMSYDHKPEDEVELARIKNAGGKVTMDGRVNGGLNLS 

SRCWSEAGKALDMSYDHKPEDEVELARIKNAGGKVTMDGRVNGGLNLS 
SRCVVSEAGKALDMSYDHKPEDEVELARIKNAGGKVTMDGRVNGGLNLS 

genomic 28209 tctggtgggagtgattgcacgggggcgcaaagggagaaggcgaggcact 

cggttcacgactatcaaaacaaatatcgtaacggatctaggtaggtatc 
tctgatgtcatacgcttcaagtaaaaaccgtttcgccgtgaccgccccc 

gi I 4505999 | ref | 398 RAI DHFYKRNKNLPPEEQMISAL 

F^ 1 DHFYKRNKNLPPEEQMISAL 
RAI G:G[ggg] DHFYKRNKNLPPEEQMISAL 

genomic 28356 agaGGTAAGGG Intron 6 CAGGGgcttaaaaacccggcaatgc 

get <1 [28366:29164]-1> aataagaaatccaaattcct 

act ccctgacgcgatgaggtact 

gi I 4505999 | ref | 422 PDIKVLTLTDDHE FMVIACDGI n 

PDIKVLTLTDDHEFMVIACDGI N 
PDIKVLTLTDDHE FMVIACDGI W:W[tgg] N 

genomic 29227 cgaagcacaggcgtagagtggaTGGTGAGCA Intron 7 CAGGa 

catattctcaaaattttcgagt <2 [29295 : 29541 ] -2> a 

tccgggtctcctacgctcttcc t 

gi ! 4505999 i ref | 44 6 VMSSQEVVDFIQSKISQRDENGELRLLSSIVEEL 

VMSSQEVVDFIQSKISQRDENGELRLLSSIVEE+ 
VMSSQEVVDFIQSKISQRDENGELRLLSSIVEEV 

genomic 29546 gaaacggggtactaaaccggaggccttttagggg 

ttggaattattacatgagaaagatgttccttaat 
ggccgatatctaagccgttatggtgagactgagg 



CDNA 



genomic 



cDNA 



genomic 



1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPEL 
MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSME 
MGAYLSQPNTVKC S GDGVGAPRL PLP YG FS AMQGWRVSME VRRQGP I GW 
2134 aggtctccaagattggggggcccccctgttgacgtcgtaggaacgcagt 
tgcatcacactagcgagtgccgtctcagtcctagggtctatggagctgg 
gtccccgccgggccgcgcccgcgggccccccgacgcccgggggggcacg 

50 DSETAMFSVYDGHGG EEVALYCA 
++ + G EEVALYCA 
PLRGGNLTEKESGGW -:G[ggg] EEVALYCA 

2281 cccggacagagaggtGGTCCTCC Intron 1 TAGGGggggtttg 

ctgggatcaaagggg <1 [2327 :25451]-1> aatctagc 

gggcgtgggagcgag gatcgetc 
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cDNA 



genomic 



cDNA 



genomic 



cDRA 



genomic 



73 KYLPDI IKDQKAYKEGKLQK ALEDAF 
KYLPDI IKDQKAYKEGKLQK ALEDAF 
KYLPDI IKDQKAYKEGKLQK ALEDAF 
25478 atccgaaagcagtaggaccaGTCTGTC Intron 2 CAGgtgggt 

aatcattaaaacaaagataa<0 [25538 : 25851] -0>ctaact 

attttccatggccgacgagg taatcc 

99 LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 
LAI DAKLTTEE VIKE LAQ I AGR PTE DE DEKEKVADE DD 
LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 
25870 tgaggataagggaagcgcaggccagggggagagggggg 
tctacatccaattaatcatcggccaaaaaaaatcaaaa 
gttccagctaactaggagtagactgtataaaaattatt 

137 DHFYKRNKNLPPEEQMISALPDI 

DH F YKRNKNL PPEEQMISALPDI 
G : G [ ggg ] DHFYKRNKNLPFEEQMI SALPDI 

25984 GGTGAGTG Intron 3 CAGGGgcttaaaaacccggcaatgccga 

<1 [25985:29164] -1> aataagaaatccaaattcctcat 

ccctgacgcgatgaggtacttcc 



cDNA 



genomic 



cDNA 



genomic 



161 KVLTLTDDHEFMVIACDGI 
KVLT LT DDHE FMVIAC DG I 
KVLTLTDDHEFMVIACDGI 
29236 agcacaggcgtagagtggaTGGTGAGCA 

attctcaaaattttcgagt <2 | 

gggtctcctacgctcttcc 



NVMS 
NVMS 

W:W[tgg] NVMS 
Intron 4 CAGGagaa 
29295:29541]-2> attg 

tggc 



185 SQEWDFIQSKISQRDENGELRLLSSIVEEL 
SQEWDFIQSKISQRDENGELRLLSSIVEE+ 
SQEVVDFIQSKISQRDENGELRLLSSIVEEV 
29555 acggggtactaaaccggaggccttttagggg 
gaattattacatgagaaagatgttccttaat 
cgatatctaagccgttatggtgagactgagg 
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